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Preface 


Most books of a technical nature arise from a specific need in some 
field of endeavor. In the new and growing field of operations research, 
students, teachers, and practitioners alike have felt the lack of a text- 
book containing illustrative problems and exercises. As far as we 
know, this book makes such material available to the general public 
for the first time. 

Much of the subject matter of this book arose out of a problems 
course in operations research, designed as a companion course to the 
general methods course offered at Case Institute of Technology. In 
developing any new course, according to the familiar law of least effort, 
one resorts to the techniques of collection, adaptation, and invention 
in that order. The resulting material will thus not be entirely original; 
it will, however, express the preference, judgment, and personality of 
those who do the work. 

The book is addressed to readers who have a working knowledge of 
the differential and integral calculus. Following the introductory ma- 
terial, which includes a problem-centered review of certain basic topics 
in probability and statistics, each chapter contains (1) the general 
theory and techniques of a particular problem area in operations re- 
search, (2) a number of completely solved problems demonstrating 
these techniques, and (3) a number of problems for the reader to solve, 
some with answers. Each of Chapters 4 through 10 is concerned with 
a separate area in operations research, and is complete in itself as a 


teaching unit. 
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ee Preface 


By following through the examples’ and solving the problems, a 
reader of this book can develop skill in formulating and solving mathe- 
matical models. He will learn little about such crucial phases of opera- 
tions research as recognition of a problem, the collection of relevant 
information, and implementation of a solution, These phases of opera- 
tions research are perhaps better mastered through practical experi- 
ence on real problems, 

This work can serve as a textbook for a one-semester introductory 
techniques course, at either the graduate or the advanced undergradu- 
ate level. Because of its intermediate mathematical level and wide 
scope, it could also be used profitably by students majoring in such 
fields as industrial engineering, statistics, economics, and applied 
mathematics. 

Maovrice Sasreni 
ArrHur YASPAN 
April 1959 Lawrence FriepMan 
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CHAPTER 


ONE 


Introduction 


One of the most remarkable phenomena of this century is the rapid 
growth in the size and the complexity of human organizations. The 
mere size of modern business means that administrative decisions can 
affect vast quantities of capital and large numbers of people. Mistakes 
can be tremendously costly, and a single wrong decision can require 
years for rectification. Moreover, the pace of modern business is such 
that decisions are required more rapidly than ever before; the mere post- 
ponement of action can give a decided advantage to a competitor. 

Tt is not surprising that the increase in difficulty of decision making 
has been followed by efforts to put this activity on a more objective and 
routine basis. The growing science of operations research, which relies 
heavily on the methods that have proved so successful in the physical 
sciences, is one aspect of these efforts. 

From the point of view of operations research, a decision is a recom- 
mendation that a particular course of action, affecting the system, be 
carried out. The decision maker attempts to choose that course of 
action which is expected to yield the “best’’ results in terms of the 
larger goals of the organization of which the system is a part. Another 
way of putting this is to say that the decision maker attempts to render 
the system more effective in furthering the goals of the organization. 
This notion of effectiveness, which will recur again and again through 
the book, is a descriptive term applied to the state of a system, much 
as “health” is a descriptive term applied to the state of a person. 
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Generally, the decision maker attempts to increase the effectiveness of 
the system under his control. 

Comparison of the effectiveness of different system states is facili- 
tated, though with some abuse to reality, by the choice of a measure of 
effectiveness: i.e. by the choice of some numerical scale for effectiveness. 
For example, the body temperature of a person is often taken as the 
measure of his health. The measure of effectiveness usually has charac- 
teristic units associated with it (e.g., dollars, dollars per month, defects 
per thousand items, degrees in the example above) ; but some commonly 
used measures are dimensionless, such as the probability of destroying 
the target in an air attack. Henceforth we shall use the terms “ef- 
fectiveness” and “measure of effectiveness” interchangeably; and both 
will refer to the incomplete description of effectiveness that results 
when we introduce a numerical scale. It is not the purpose of this book 
to discuss the problems involved in specifying a measure of effective- 
ness. Notwithstanding this omission, it remains true that a major part 
of most practical operations research studies consists of identifying 
the decision makers, discovering their (possibly conflicting) objec- 
tives, and translating these objectives into points along compatible 
scales. 

Once the effectiveness scale has been specified, the task becomes one 
of choosing from the available courses of action the one that:maximizes 
the resulting effectiveness. In the simplest case, the effectiveness is a 
uniquely determined function of the course of action, and the appropri- 
ate course can be chosen by an application of standard mathematical 
techniques. Usually, however, the situation is more complicated, in 
that the effectiveness depends not only on the course of action, but also 
on some list of external variables whose values are beyond the control 
of the decision maker. These external variables may be continuous or 


have to be included in the mathematical model relating the system 
effectiveness to the courses of action. After the decision problem has 


In order to clarify the role of mathematical models in operations re- 
search, it is helpful to look at the role of such models in the more fa- 
mailiar context of mechanics. The physics student is presented with 
a series of propositions concerning the behavior of certain ideal systems. 
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He finds that these propositions, expressed in mathematical symbols, 
form a model of the real world which can be manipulated far more easily 
than the real world itself. 

Newton’s three laws of motion, together with a postulated force 
called friction which opposes motion, give a mathematical model of the 
physical world which accounts reasonably well for many observed 
phenomena. Other phenomena cannot be explained in this simple way; 
depending on our purpose, we may have to consider air resistance, 
earth rotation, electrostatic charges, and other factors. Each succeed- 
ing variable added to the problem enables us to represent the physical 
world more closely, at the cost of greater and greater complexity in 
analysis. How complicated should the model be? This is a matter of 
personal preference; the fashion is to choose the simplest model that 
answers the type of questions we wish to ask about the system. 

This book has been written in the hope that it will play the same role 
in the training of the operations research worker as a text on basic 
physics plays in the training of the engineer. It deals with a selection 
of the mathematical techniques that have been found useful in opera- 
tions research. The approach is mainly through the medium of worked 
examples, against a background furnished by some discussion of the 
relevant problem area. The problems discussed are simple—many so 
simple that it is hard to imagine real situations to which they could be 
applied without drastic refinement. Apart from added complexity, 
real-life problems differ in another way from those of this book. For 
ease of presentation, problems are classified in chapters according to 
the basic function of the system in which they arise. Such a simple 
classification hardly corresponds to reality; in practice, some sort of 
synthesis of prototype models is required, with the resulting model 
adjusted to fit the real situation closely enough to solve the problems 
under discussion. In this connection, it should be pointed out that 
mathematical sophistication, though frequently necessary, has no merit 
per se, Although a great deal of elegant mathematics appears in the 
literature, far more problems have been solved by the intelligent ap- 
plication of simple arithmetic. 

The reader who wishes to get the most out of this book should have 
some acquaintance with the main concepts and results of the calculus. 
In particular, he should be familiar with the routines for determining 
the extreme values of a function of one or more variables, and with the 
interpretation of a single or multiple integral as the limit of a sum. 
Occasional allusions are made to the calculus of finite differences; but, 
as this is less widely taught than ordinary calculus, the portion relevant 
to our purpose has been treated in an appendix. Finally, a modicum 
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of statistics is assumed, equivalent to that which would be obtained by 
a student in a one-semester course. 

The subject matter proper of this book begins with a brief survey of 
the elements of probability theory, including the notions of probability 
of an event, random variable, probability distribution, and conditional 
probability distribution. This material is basic for an understanding 
of some of the remaining chapters, and should serve as a good review 
even for those readers who are already familiar with probability. The 
succeeding chapter discusses sampling, and gives an introduction to the 
Monte Carlo method of simulated sampling. Chapters 4 through 10 
present various problem areas which have gradually taken distinct 
shape in operations research: in order, they are inventory, replacement, 
waiting lines, competitive strategies, allocation, sequencing, and dy- 
namic programming. 
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Probability 


Some familiarity with the basic notions of probability is essential 
for an understanding of the remaining chapters of this book, The in- 
formation on which decisions are to be based is usually in the form of 
numbers; and these numbers, as well as the conclusions derived there- 
from, are usually doubtful to a greater or lesser extent. The theory of 
probability furnishes a means for expressing the degree of doubt in 
mathematical terms. 

A rigorous development of probability theory involves logical and 
mathematical difficulties of high order, and will not be attempted here. 
Instead, we shall present working definitions of such concepts as “ran- 
dom variable” and “probability distribution,” and illustrate their 
application with examples, The chapter is intended to serve as a brief 
review for those readers who have an understanding of probability 
theory, and as a guidepost to further study of the subject for those 
who come to this book with less preparation. Several good references 
are given at the end of the chapter. 


SIMPLE EVENTS 


Suppose that within a particular limited environment the possible 
states of nature are Ey, Es, «++, Bn, and that the environment periodi- 
cally changes from one state to another in some random fashion. For 
example, the environment might be a coin lying on the ground with an 
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observer standing beside it; the relevant set of states might be “heads 
up” and “tails up”; and possible state changes could result from the 
observer’s picking up the coin and flipping it in the air. We shall call 
the various possible states H; “simple events” and shall refer to the 
action precipitating a possible state change as a “‘trial.”’ 

Ignoring the philosophical difficulties involved in such a definition, 
we shall define the probability of the simple event Z, as “an estimate 
of the value approached by the ratio 


Number of occurrences of E; 
Number of trials 


as the number of trials tends to infinity.” We will write P(E,) for the 
probability of the event Z;. 

For definiteness, let us focus attention on the simple event 2. 
Clearly P(2;) is a positive number between 0 and 1. If Z; is certain 
to occur at each trial, then the ratio above will be unity at the end of 
each trial, and we have P(Z;) = 1. If the occurrence of H; is impos- 
sible st each trial, then the ratio will be zero at the end of each trial, 
and we would have P(Z;) = 0. If Z, is considered more likely to occur 
than Hp, then our estimate of the number of occurrences of E; ina large 
number of trials should be larger than the estimate of the number of 
occurrences of Ha; and thus P(E;) > P(Z2). Also, if our estimating 
procedure is to be consistent, we must have 

P(E;) + P(E2) +--++ P(Ep) = 1 
since the sum over 7 of the number of occurrences of H;in N trials must 
equal the number of trials. 

In some situations, such as in the toss of an unbiased coin or a true 
die, we might feel that our estimate of the number of times the state H; 
will appear in a large number of trials should be the same for each 7. 
In such cases we would have 

P(B;) = P(E2) =---= P(En) 


and, since the sum of the probabilities for the various simple events 
must be unity as above, we would then have 
P(E;) =1/n — for eachi 

Tn tossing an unbiased coin, this would imply that the probability of a 
head (or a tail) would be assigned the value 1/2; in the case of a true 
die, the probability of a 1 (or any other positive integer between 1 and 
6) would be taken as 1/6. 

For some purposes it is convenient to construct a geometric model 
of the situation under discussion, by identifying the realization of the 
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state Z; with a point B; in some geometric space. The outcome of a 
trial is then some point in the so-called ‘‘sample space” made up of the 
n points By, Bg, ---, Bn; and the probability associated with any point 
B; is defined as the probability of the corresponding simple event Z;. 
Though we shall not go into the subject very deeply, the idea of a 
sample space is fundamental in probability theory. The two main 
properties of a sample space are: (1) each conceivable outcome of a 
trial is represented by one and only one point in the corresponding 
sample space, and (2) each point of a sample space has associated with 
it a non-negative number called the probability of the corresponding 
simple event, with these probabilities adding to unity. 

As an example, suppose that the environment consists of three coins 
on the ground with an observer beside them, and that a trial consists 
of tossing all three coins. There are eight possible states, viz. : 


H, H, H T,H,H 

H, H, T A UP i ig 

H, T, H Ti, oH 

yrs 1 UL 
(Here, for example, the outcome H, T, H means “heads for coin 1, tails 
for coin 2, heads for coin 3.”) If we have no reason to expect the more 
frequent appearance of heads than tails in one toss of one coin, then 
the eight possible states could be regarded as equally likely to occur 
after a trial, and we would associate the same probability (1/8) with 
each of the possible simple events. The sample space could be taken 
as simply eight points By, ---, Bg strung out along a line; but the sym- 
metric arrangement of Figure 2.1 is perhaps more suggestive. 


Coin 3 
Sample space where trial is a toss of three coins. 


Figure 2.1. 
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If a (single) trial consists of the toss of two dice, then each die can 
show one of the faces 1 through 6; so there are 36 possible outcomes, 
viz.: 

1-1 21 3-1 41 51 61 
1-2 22 32 42 52 62 
1-3 23 33 43 53 63 
14 24 34 44 54 64 
1-5 25 35 45 55 65 
16 26 36 46 56 66 


If the dice are true, all the outcomes are regarded as equally likely, and 
the probability associated with each would be 1/36. The outcomes can 
be conveniently represented by the points of a 6-by-6 lattice in two di- 
mensions, as illustrated in Figure 2.2. 


SR ae ia: 
Die 2 


Figure 2.2. Sample space where a trial is a toss of two dice. 


COMPOSITE AND RELATED EVENTS 


The simple events discussed in the previous section are the set of 
mutually exclusive and exhaustive states that could occur as the result 
of a trial. Our first generalization is the notion of an event G which 
occurs along with any one of a number of designated simple events. 
For example, if the trial is a throw of two dice, the event G might be 
“The sum of the points on the two dice is seven.” Then G occurs along 
with the following six states: 


1,6 2,5 3,4 
4,3 5,2 6,1 


Probabilityi. 2S 2 8k a oe 9 


Alternatively, one may say that this event @ occurs at the six points 
along the main diagonal in the sample space of Figure 2.2. The proba- 
bility of such a derived event is defined in the following way. 

The probability P(G) of the event G which occurs at each of a set of 
points in sample space is defined as the sum of the probabilities of all the 
points in the set. 

Thus the probability of obtaining a “seven” in one throw of two dice 
is the sum of six probabilities, each of value 1/36, or 1/6. This example 
illustrates an immediate consequence of the above definition. If all the 
points in the sample space have the same probability, then the proba- 
bility of the derived event @ is simply the ratio of the number of points 
at which G occurs to the total number of points. 

The next generalization deals with events that are defined (stated) 
relative to two derived events Gand H. There are two basic ways in 
which this can be done. The event may be defined as the occurrence 
of both of the events G and H, or it may be defined as the occurrénce 
of at least one of the events Gand H. We shall describe events of the 
former type as the intersection of G and H, and write such an event K as 


K=GH (Gand H occur) 


Events K of the second type are described as the union of Gand H, and 
we write 


K=G+H (either G or H or both occur) 


The concepts of union and intersection apply equally well to simple 
events, since a simple event is a special kind of derived event; how- 
ever, the intersection of two simple events is of little interest since it 


consists of no sample points whatsoever. 
To illustrate, let the trial be a throw of two dice, and let the events 


G and H be as follows: 


G: the sum of the two faces is seven or more. 
H: the first die shows one of the numbers 2, 3, 4. 


Then the event G occurs at points within the triangular-shaped region 
of Figure 2.3; the event H occurs at points within the rectangular 
1H occurs at points within the trapezoid; and 


region; the intersection @. with ; , 
the union G+H oceurs at all the points that lie either in the triangle or 


in the rectangle or in both. By a count of points, we see that P(GH) = 
9/36, and P(G + A) = 30/36. 

The ideas of intersection and union can be extended to more than 
The intersection of n events is the set of sample points 


two events. : 
union of 7 events is the set of all those 


common to all n events; and the 
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Die 2 


Figure 2.3. Intersection and union of two derived events, 


sample points which lie in at least one of the n events. Also, one may 
speak of intersections and unions of events which are themselves com- 
posites of derived events, and so on. 

The probability of the event that is the union of two other events G 
and H is, by definition, the sum of the probabilities of each _ the 
sample points in G+H. Conceptually we enumerate all the points 
that lie in either @ or H, being careful to include each point no more 
than once, and arrive at the formula 


(2,1) P(@ + H) = P(@) + P(H) — PH) 


If the events G and H have no sample points in common, the events are 
said to be mutually exclusive, and we have the simpler result 


(2.2) P@+ H) = P(@) + P(A) 


In words, the last result says that, if two events are such that the oc- 
currence of one precludes the occurrence of the other, then the proba- 
bility of occurrence of one or the other is the sum of the individual 
probabilities of occurrence. 

Although equation (2.1) is a useful formula, it does no more than tell 
us how to compute the probability of the union of two events from the 
probability of their intersection, and vice versa. For applications, we 
need a better way of computing the probability of composite events 
than the complete enumeration of the sample space followed by the 


Proboabihitey, xin wh ne 11 


count of the relevant sample points. Such a means is furnished by the 
concept of conditional probability. 

The conditional probability of the event H, given the occurrence of 
the event G, is denoted by P(H|G), and is defined as the probability of 
occurrence of H in the reduced sample space obtained from the original 
sample space in the following manner: 


(a) The sample points in the new sample space are just the sample 
points comprising the event G. 

(b) The new probability associated with each simple event H; in the 
new sample space is the original probability P(H;,) divided by P(G). 
(Then the sum of the new probabilities for the points in the new sample 
space will be unity as required.) 

The only points of H that are in the reduced sample space G are 
those in the intersection GH, as shown in Figure 2.4. Thus the condi- 


Original 
sample 
space 


Reduced 


Figure 2.4. Reduced sample space used in defining P(H/@). 


tional probability P(H|@) is, by definition, the sum of the new proba- 
bilities for points in GH; in other words, it is the sum of the old proba- 
bilities for points in GH, divided by P(G). So we have as our formula 
for conditional probability: 


(2.3) P(A|@ = ay for any two events G and H with P(@) > 0 


This formula is not only useful in its own right, but it also gives us the 
means for calculating P(GH), namely 
(2.4) P(GH) = P@ P@|6) 


Because of the reduction in the sample space, it is usually much easier 
to calculate P(H|G) first and then use formaulas (2.4) and (2.1) than it is 
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to calculate the probabilities of the union and intersection directly from 
the original sample space. 

It may happen that the conditional probability of H, given the oc- 
currence of G, is the same as the original (unconditional) probability 
of H. If this is true, the event H is said to be independent of the event 
G, and we have the simpler formula 


P(GH) = P(G) P(H) if H is independent of G 


If H is independent of G, then it can be easily shown that G is inde- 
pendent of H; so we may speak of the independence of two events G 
and H without regard to the order. Because of this symmetry, we may 
write 


(2.5) P(GH) = P(G) P(A) __ for independent events G and H 


In words, we say that, if two events are independent, then the proba- 
bility of their simultaneous occurrence is the product of their individual 
probabilities of occurrence. 

The relationships (2.1), (2.2), (2.4), and (2.5) can be generalized to 
the case of n events. We give the generalizations for (2.2), (2.4), and 
(2.5), and refer the reader to Chapter 4 of Feller (see references at the 
end of the chapter) for the general statement of (2.1). 

If the events G1, Ge, ---, G, are mutually exclusive in pairs, then 


(2.6) P(G, + G2 ++-:+ Gn) = PG) + P(G2) +--++ PG) 
For any n events Gi, Go, «++, Gn: 
(2.7) P(G,G@2 +++ Gp) 
= P(G;) P@2|G1) P(@3|G,G2) --- P(@n|GiGz --- Gn—1) 


If the events G;, G2, ---, Gn are independent (in the sense that the 
occurrence of a G; is independent of the occurrence of any of the re- 
maining events, and of any intersection of them), then 


(2.8) P(GiGz --- Gn) = P(Gi) P(G2) --- PGn) 


From this point on, we shall regard the results (2.1) through (2.8) as 
valid, regardless of whether the sample space is or can be made explicit. 
In the examples that follow, the sample space will be enumerated only 
when that appears to be the most direct way of solving the problem in 
question. 
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Example 1 
.A box contains 25 parts, of which 10 are defective. Two parts are 
drawn at random from the box. What is the probability 


(a) that both are good? 
(b) that both are defective? 
(c) that one is good and one is defective? 


SoLUTION 
(a) Let E be the event “the first part drawn is good,” and F be the 
event “the second part drawn is good.” Then EF is the event “both 
are good.” 
P(EF) = P(E) P(F\E) = 4§-434 = vo 
(b) By similar reasoning, the probability that both are defective 
is found to be $2-2% = 3 


(c) Let E, = event “both are good” 
E> = event “both are defective” 
E3 = event “one is good and one is defective” 


Since at least one of these events must happen, 
P(E; + E+ Es) = 1 
But, since Z;, Ez, and Ey are mutually exclusive, 
P(E, + Eo + Es) = P(Es) + P(E2) + P(Es) 


Thus 
P(E:) + P(E2) + P(Es) = 1 


P(Es) = 1 — P(Ei) — PEs) 
=1-ge-ao- 4 
Another way of arriving at this result is to note that the desired 
event is G+H, where 
G = event “first is good and second is defective” 
H = event “first is defective and second is good” 
P@ +H) = P@ + PH) 
since the two events are mutually exclusive 
=Hed2+404-408-4 


Example 2 : i 
The ten digits, 0, 1, 2, «++, 9 are arranged in random order. Find the 


probability that the digits 3 and 4 are neighbors. 
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SoLuTion 
One way for 3 and 4 to be neighbors is 


384XXXXKXXX 
where the X’s represent the remaining numbers; another is 
X34XXXXXXX;ete. 
One easily sees that there are exactly 18 such possibilities, viz.: 
384X XXX XXXX: Event FE, 
X34XXXXXXX: Event Ey 


XXXXXXXX34: Event Hy 
43XXXXXXXX: Event Ly 
X43XXXXXXX: Event Fy, 


XXXXXXXX43 : Event Eg 
P(E, + Ey +++++ Eyg) = P(Ey) + P(E2) +--++ P(Ejs) 
since the events are mutually exclusive 
= 18P(E,) since the events are equally likely 


= 18(75-4) =4 
Example 3 
A series of trials is conducted, where each trial must yield exactly 
one of the events 2; and ZH. If the probability that a trial yields 2, 
is known to be p, and if the result of a trial is independent of the results 
of previous trials, find 


(a) the probability that n trials yield exactly k Hy’s. 
(6) the probability that 7 trials yield at least k Ey’s. 
(c) the probability that n trials yield at most & Zy’s. 


SoLuTIonN 


(a) If & is negative, or greater than n, the probability of the event 
in question is obviously zero, 
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Let k be some integer of the set 0, 1, 2, ---,”. We first calculate 
the probability of the event “the first & trials result in Z,, and the last 
n—k trials result in E3.” By equation (2.8), the probability of this 
event is 

pl — pr * 
There are many other ways for the desired event to occur; all involve 
the result Z, for some set of & trials (not necessarily the first & trials), 
and the result Z2 for the n—k trials not in this set. There are Cy, 
such ways, where 
n! 


k\(n — k)! 
and each of these ways is equally likely. Thus the probability P() of 
the event “exactly & H;’s in n trials” is given by 
P(k) = Cnxp*(1 — p)"* 
(b) The event “at least k Hy’s in n trials’ may be written as the 
event (Fx + Freya +:+:+ Fn), where F; is the event “exactly 7 Z,’s 
in 7 trials,” 


P(Fr + Fras +++ ++ Fn) = PCF) + PPegs) +:+°+ Pn) 
since the F; are mutually exclusive 


Cuk = 


o x P(Fi) 
i=k 


n 
= >C,.p(l— p)"* _ by part (a) 
imk 


(c) In a similar fashion, we find that the probability of the event 
“at most k B,’s in n trials” is 


k . 
Yew rd 7)" 
i=o 


Example 4 , : ; 
Given the information that a family contains two children and that 
at least one of these two children is a boy, find the probability that 


both are boys. 


Soiution *t : 
This is a problem in conditional probability, and is solved most 
directly by considering the sample spaces. The original sample space 
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consists of the four points BB, BG, GB, and GG, each with the proba- 
bility 1/4. The reduced sample space, comprising the points including 
at least ore boy, consists of the three points BB, BG, and GB, each 
with the probability 1/3. So the answer to the problem is 1/3. 

Note that this is not the same question as: Given that there are two 
children C and D and that C is a boy, find the probability that D is 
also a boy. The answer to the latter question is of course 1/2. 


Example 5 
A, B, and C each fire one shot at a target. The probabilities of each 
hitting the target are: 


A: 0.30. 
B: 0,25. 
C: 0.10. 


If one bullet is found in the target, find the probability that it came 
from A’s gun; from B’s gun; from C’s gun. 


SOLUTION 

There are eight possible outcomes to a trial consisting of three shots, 
one by each of the marksmen. For simplicity, let 1 represent a hit and 
0 a miss; then 1~1-0 represents the event “‘A hits, B hits, C misses,” 
etc. The eight possible outcomes together with their probabilities of 
occurrence after a trial are: 


Outcome Probability 
0-0-0 (.70) (.75) (-90) 
0-0-1 (.70) (.75) (.10) 
0-1-0 (.70) (.25) (.90) 
0-1-1 (.70) (.25) (.10) 
1-0-0 (.30) (.75) (.90) 
10-1 (.30) (.75) (-10) 
1-1-0 (.80) (.25) (.90) 
1-1-1 (.30) (.25) (.10) 


The reduced sample space consists of the points 1~0-0, 0-1-0, and 
0-0-1; and the new probabilities are the original probabilities, each 
divided by the sum of the three probabilities. The conditional proba- 


bility of 1-0-0, given the occurrence of one of the events 1-0-0, 0-1, 
0-0-1, is thus 


(.30)(.75)(.90) a7 
(:30)(-75) (.90) ++ (-70)(.25)(.90) + (.70)(.75)(.10) 56 


The probability that the bullet lodged in the target should be credited 
to A is 27/55. The probabilities for B and C are similarly found to be 
21/55 and 7/55. 


Example 6 

A certain popular 5-cent confection comes in two varieties: Plain 
and Nut. The items are sold exclusively through jobbers. In order to 
have information on distribution at the retail level, the company in- 
vestigates a sample of randomly selected retail stores about once a 
year. Results are tabulated to yield the following three observed 
probabilities: 


a = probability that a store selected at random has Plain in stock 

b = probability that a store selected at random has Nut in stock 

c = probability that a store selected at random has at least one of 
the company’s products in stock (i.e. either Plain or Nut or both) 


Ic>a; c2>b; ¢Satd] 


The company wishes to go back through its historical records, and 
infer the following numbers: 


a, = probability that a store selected at random normally carries 
Plain (even if out of stock of Plain at the time of observation). 

ay = probability that a store that normally carries Plain has Plain 
in stock at a random instant 

b, = probability that a store selected at random normally carries 
Nut 

bp = probability that a store that normally carries Nut has Nut in 
stock at a random instant 


(These four numbers cannot be calculated directly since the original 
data are unavailable.) 


SoLuTIon 

There are four unknowns 4}, 2, bi, be, and we attempt to find four 
relations among them. We first note that a store has a variety in stock 
if and only if it normally carries the variety and the variety is in stock 
at the moment of observation. This immediately gives two relations: 


(2.9) @ = 402 
(2.10) b = bibs 


In order to obtain two more relations, it is necessary to make further 
assumptions. Plain is the dominant variety of the two, and is the item 
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with which the company is usually identified. The following assump- 
tions are considered plausible: 


I. The probability that a store that normally carries Nut is momen- 
tarily out of Nut is twice the probability that a store that normally 
carries Plain is momentarily out of Plain. 

II. Any store that normally carries Nut also normally carries Plain. 


Assumption I gives us our third relation immediately. 
1 — be = 21 — ay) 
or 
(2.11) be = 2ag — 1 
Assumption II permits us to relate c to the other quantities, viz.: 
¢ = P {a store has either Plain or Nut in stock} 
= P {a store has Plain} + P {a store has Nut} 


— P {a store has both Plain and Nut} 
=a+b-—P {astore has both} 


Now 
P {a store has both} = P {store normally carries both and 
has both} 
= P {store normally carries Nut and 
has both} by Assumption IT 
= byagbe 
=a2b from (2.10) 
The fourth relation is thus 
(2.12) c=a+b— dob 


The four simple equations (2.09) through (2.12) yield easily results 
of the type needed: 
ab 


sdk Ene 
a+b-ce 


a 


An idea of the way this works out numerically is given in the table 
below: 


Observed probabilities Inferred probabilities 
Year 
ofsurvey a 6 ay a2 by be 
1 0.37 0.15 0.40 0.467 0.800 0.250 0.600 
2 0.40 0.21 0.43 0.467 0.857 0.294 0.714 
3 0.43 0.26 0.45 0.495 0.923 0.307 0.846 


PROBLEMS FOR SOLUTION 
Set I 


1. What is the probability that a bridge hand 


(a) will consist entirely of hearts? [Ans. 1.6 X 10-7] 
(b) will contain no hearts? [Ans. 0.01] 
(c) will be made up entirely of one suit? [Ans. 6.3 X 10-!?] 


[Hint: (a) The probability that the first card drawn is a heart is 13/52. The 
probability that the second card is a heart, given that the first was a heart, 
is 12/51. Continuing in this manner, we apply equation (2.7) and end up with 
the product 

13.0 02) ud 1 
52 © 51 < 50°" 20 


as the answer to part a. Part b is solved in a similar fashion. To solve part 
c, note that the event “all thirteen cards of one suit” is the union of the four 
mutually exclusive events “all spades,” “all hearts,” “all clubs,” “all dia- 
monds.”] 

2. There are 26 people at a cocktail party. What is the probability that at 
least two of them have the same birthday? [Ans. 0.60] 

[Hint: First calculate the probability that all 26 birthdays are different, 
using equation (2.7). Then subtract this probability from unity.] 


3. What is the probability of throwing exactly three sevens in six rolls of 
two dice? [Ans. 0.05] 


[Hint; Use the result stated in Example 3 of the text.] 

4, Find the probability of hitting a ship when one fires five torpedoes, each 
with the probability 1/4 of scoring a hit. [Ans. 0.76] 

5. If I hold two aces in a bridge hand, what is the probability that my 
partner holds exactly one ace? [Ans. 0.46] 


6. A man starts at a certain corner and strolls four blocks. Each time he 
comes to a corner, he chooses one of the directions north, south, east, and west 
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with equal probability. What are his chances of ending his walk at the corner 
where he started? [Ans. 9/64] 


7. Three players A, B, and C are engaged in a sequence of games in which 
the winner of each game scores one point. The one that first scores three points 
will be the final winner. A wins the first and third games, whereas B wins the 
second. Find the probability that C will be the final winner. [Ans, 2/27] 


8. You are told that six tosses of a coin yielded four heads and two tails, 
Assign a probability to the event: “At the end of each toss, the cumulated 
number of heads exceeded the cumulated number of tails.” [Hint: Enumerate 
the sample space. The answer is 1/3. 


DISCRETE DISTRIBUTIONS 


In the previous section we developed the idea of a trial resulting 
in a random appearance of one of a set of states H, Hp, -- +, and in- 
troduced the notion of sample space as a convenient geometrical model 
for the outcomes. Here we introduce the concept of a random variable 
defined on points of a sample space. A random variable is simply a set 
of numbers 21, 22, - + *, one for each state, so that the result of a trial is 
not merely the state 2; but also the number of interest x; For ex- 
ample, if the trial is a toss of two dice, a random variable could be 
defined as the sum of the points shown by the dice. If we denote this 
random variable by z, then x would have the value 2 for the state 
(1, 1), 3 for the state (1, 2), 7 for the state (3, 4), and so on. Note that, 
although there are 36 points in the sample space, there are only 11 
possible values for z, namely one of the integers 2,3,4, -++,12. The 


TABLE 1 

Random variable x Frequency function P(x) 
Sort ae ee 

2 1/36 

3 2/36 

4 3/36 

5 4/36 

6 5/36 

7 6/36 

8 5/36 

9 4/36 

10 3/36 

il 2/36 

12 1/36 


Total 36/36 
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event: “x = 2” occurs at only one point of the sample space; so the 
probability of the event “x = 2” is 1/36. The event “x = 3” occurs 
at the points (1, 2) and (2, 1) and thus has the probability 2/36. The 
Frequency function of the random variable x is the complete listing of 
the values of the random variable, together with their probabilities of 
occurrence. In this example, the frequency function is defined by 
Table 1. 

The sum of the individual probabilities in any frequency function 
is 1; this arises from the fact that one and only one of the various out- 
comes occurs as the result of a trial. 

Many random variables can be defined on the same sample space. 
If the trial is a throw of two dice, we might also define the random 
variable x as the smaller of the number of points showing on the two 
dice (or as the common number, in case of a tie). In this case x would 
have the range 1, 2, 3, 4, 5,6. The event “x = 6” occurs only at the 
sample point (6, 6), and has a probability of 1/36. The event “x = 5” 
occurs at the three points (5, 6), (6, 5), (5, 5) and has a probability of 
3/36; and so on. The complete frequency function for this random 
variable is given in Table 2. 


TABLE 2 
Random variable x Frequency function P(x) 


11/36 
9/36 
7/36 
5/36 
3/36 
1/36 


Total 36/36 


oOarAwANre 


Often the frequency function of a random variable is characterized 
by two derived numbers: its mean and its variance. Such a description 
of a frequency function, though admittedly incomplete, is valuable for 


many of the applications. > 
Let x; (i = 1, 2, ---, n) be the various possible values that a random 


variable can assume, and let P(x;) be the probability that the random 
variable assumes the value 2;. If we denote the mean by @ and the 


variance by Var (x), then we define 
(2.13) #= Dia; PO) 


8C.E.R.Y, West fenu 
, ee iA oP WA 
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(2.14) Var (x) = >> (2; — 2)? P(z;) 
i=l 


For the computation of the variance, it is often more convenient to 
use the equivalent formula 


(2.15) Var (z) = D> 2? P(x;) — (@)? 
iat 


The mean of z is an estimate of the value of x to be expected from a 
trial, and the variance is a measure of the expected scatter of achieved 
values of x about this expected value, 

Just as in the application of the formulas for the probability of com- 
posite events, the underlying sample space need not be explicit in order 
for the concepts of random variable and probability distribution to be 
used. A good working definition of a random variable is that it is a 
chance number turned up by a trial from among the set of numbers 
(x1, 42, +++); the frequency function of the random variable is then a 
set of non-negative numbers 


P(x), P(a2), »+* 
one for each x;, such that 


D Poa) =1° 


With such working definitions, as indeed in any complete treatment of 
sample spaces, the number of possible values of the random variable z 
does not have to be finite. An example of a discrete random variable 
that has an infinite number of potential values is: the number of tosses 
of a single coin until the appearance of the first. head. The event “xz = 
1” is equivalent to the event “first toss is a head,” and thus has the 
probability 1/2. The event “x = 2” occurs along with the event “first 
toss tails, second toss heads,” and has the probability 1/4, ete. Here 
the range of is the totality of positive integers 1, 2, 3, ---, and the 
frequency function of z is given by the formula 


P{z =n} = (4)" 


for each positive integer n. We might mention that a random variable 
with an infinite number of potential values must have arisen from a 
sample space with an infinite number of points, since there is at least 
one sample point for each possible value of the random variable. How- 
ever, the probabilities must form a convergent series whose’ sum is one. 
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In the above example, by the well-known formula for the sum of a 
geometric progression, 


2 Pisa aha 2 G1 


There are a few discrete probability distributions which recur so often 
in the applications that they have been given special names, such as 
“binomial distribution,” “Poisson distribution,” and so on. These are 
listed below with a few of their properties. 


Binomial Distribution (cf. Example 3) 


Range of z: 0, 1, 2, +++, n, 
Frequency function: P(k) = Cn,x p*(1 — p)"™. 
Mean: # = np. 
Variance: Var (x) = np(1 — p). 


Poisson Distribution 


Range of z: 0,1, 2,3, +--+, to». 
nte—> 
Frequency function: P(k) =——— where ) is a positive param- 
kl eter. 
Mean: =. 
Variance: Var (x) = d. 
Uniform Distribution 
Range of z: 0,1, 2, +++,” 
i P(k : 
Frequency function: P(k) = S| 
M & s 
ean: F 
: + n® 3 n 
Variance: Var (x) = 26 


Geometric Distribution 
Range of x: 
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Frequency function: P(k) = p*(1 — p) where p is a positive param- 
eter less than 1. 
P 


Mean: = 


: z) = ——_.. 
Variance: Var (x) ap 
Before proceeding to some examples, we mention the concept of a 
cumulative distribution function for a random variable x, If the range 
of possible values for x is (1, 2, +++, 2n), where the %; are arranged 
in increasing order, then the cumulative distribution is defined, for any 
number y, as the probability of the event {z< y}. If we denote the 
cumulative probabilities by U(y), then it is apparent that if y < 2, 
Uy) = 0, and if y > aq, U(y) = 1, while 


(2.16) UW) = Le P@) fora;<y< Rip 
ed 


Example 7 

An owner of five overnight cabins is considering buying television 
sets to rent to cabin occupants, He estimates that about half of his 
customers would be willing to rent sets. Finally, he buys three sets, 
Assuming 100% occupancy at all times: 


(a) What fraction of the evenings will there be more requests than 
TV sets? 

(b) What is the probability that a customer who requests a television 
set will receive one? 

(c) If the owner’s cost per set per day is C, what rental R must he 
charge in order to break even in the long run? 


Sorution 

(a) The daily number of requests is a random variable, subject to 
the binomial distribution with n = 5 and p = 1/2. We want the 
probability that this random variable assumes the value 4 or 5. This 
probability is Cs 4(3)° + C5.5(3)° = a8 = 0.188, 

(b) Any one of the following set of mutually exclusive events will 
result in a given customer’s request being satisfied: 

1. There are no other requests that night. 

2. There is one other request. 

3. There are two other requests. 


4, There are three other Tequests, and his precedes at least one of 
them. 
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5. There are four other requests, and his precedes at least two of 


them. 
The probabilities of the first three events listed are: 


P; = (3)4 
Pe = C4,1(3)* 
P3 = C4,2(3)* 
Assuming that his request is randomly situated in time with respect 
to the other requests, the probabilities of events 4 and 5 are: 
| Ps = $C4,3(3)4 
Ps = $C4,4(3)* 
The probability of the desired event is the sum of these five prob- 


abilities, or 73/80. 
(c) The daily revenue T is a random variable with the following 


distribution: 


Au Probability 

0 (3)° = ga 
R 5) = gr 

2R 10(3)° = 33 


3R 10(3)° + 5(3)> + (4)° = 3% 
Mean T = T = oy -R+4$-2R+4$-3R 
= FR 
The cost per day is constant at 3C. The break-even rental is the 
value of R for which 
BR = 3C 


or R = 1.315C 


Example 8 ‘ 
Prove that for the uniform distribution, defined by 


1 
n+1 
P(k) =0 k<0 or k>n 


k =0,1,2,--+,7 


P(k) = 


the mean is n/2 and the variance is n”/12 + n/6. 
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SoLuTion 
We make use of the following results from elementary algebra: 


1424---+7r = $r(r +1) 
1242? +--+ 7? = Sr(r + 1)(2r + 1) 


Thus 
7 1 pe 1 nn+1) on 
£=yi- 7 2) Le ee 
imo «6M tl ntl n+1 2 2 
n 1 (@y 
Vv ; ay ee 
ar (x) = ze ay. Ns 
a at 
n+1lig 4 
1 n(n+1(2Qrn+1) 
ntl 6 4 
Wig 
12 6 
Example 9 


A manufacturing process involves the processing of a part by two 
machines, machine A and machine B, in that order. The part is subject 
to breakage in each machine; and, when a part breaks, the machine is 
out of service temporarily for repairs. If the probability of breakage 
of a part is p; in machine A and pe in machine B, find the expected 
number of parts that will be completely processed before the first 
breakdown. 


Sonution 
We shall make use of the following elementary algebraic result: 


yt +3itee—4 it -1<y<1 
(l—y) 
Let x denote the number of finished pieces at the time the process 
stops. 
The random variable z will take the value 0 if either of these two 
mutually exclusive events occurs: 


1. The first part breaks in machine A. 
2. The first part goes through machine A, but breaks in machine B. 


Probabilityns-s& 6s eee seenive rent) = 27 
Then 
P{x=0} = pi + (1 — pi)pa = Pi + Po — Dida 


For convenience, set Po = pi + P2 — Pipe. Clearly Po is also the 
probability that any started part will not go through successfully. 

The random variable x will take the value k if both the following 
events occur: 


1. The first k parts go through successfully. 
2. The k+ 1st part breaks. 


Thus 
P{x=k} = (1 — Po)*Po 


The mean @ is obtained, as usual, by 


oe 
£= Dk P{r=k} 
k=O 


= Dk Po( — Po)* = Py 2) (1 — Po)* 
k=0 kat 


ars 
= A = e by the above algebraic identity 
0 
1 
Thus 4 = al 
Pi + P2 — Pip2 


(This value for the mean can also be obtained by noting that x follows 
a geometric distribution with parameter 1 — Po.) 


Up to this point we have considered only situations where a trial 
results in a value for one random variable. In some of the applications, 
however, a trial can involve two or even more random variables. An 
example is a trial consisting of the selection of a man from a population 
and the measurement of his height and weight; another is the selection 
of a card from a deck and noting its suit and its value. We shall deal 
specifically only with the bivariate case of two random variables; the 
extension of the ideas to more than two random variables involves 
only some expansion of the notation. ( , 

If x and y take on only discrete values, with x ranging over the set 
(a, tg, «++, mn) and y over the set (yi, Y2, “+1 Ys), then the result of 
a “trial” will be one of the ns points (aj, ¥;)- The probability dis- 
tribution of the result of a trial is completely specified by the ns in- 
dividual probabilities associated with the ns events {2 = 2x3 y = yj}. 
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In other words, the joint frequency function of x and y is given by 
Pi yj) = 05 t=1,---,n; JF=1,+--,8 
The values of n, s, and the individual a; are what serve to distinguish 


one joint probability distribution from another. For any such joint 
distribution, 2Za;; = 1. 


Even in the bivariate case, it is meaningful to speak of the “f requency 
function of x’; this is defined conceptually by simply suppressing the 
value of the second variable in the result of a trial. It can be shown 
that, in the bivariate case, 


8 
Pi{z=z,;} = >2: P(xi, yj) for each 7 
and similarly that 


Piy=y;} = Pei, y) for eachj 
fal 


Example 10 

Two dice are tossed four times. Find the joint distribution for 2, 
the number of times a 7 is obtained, and y, the number of times an 
11 is obtained. 


SoLvuTion 

Here a single trial consists of tossing the dice four times. One 
approach to the problem might be to regard the sample space as 
consisting of the 11* points (2, 2, 2,2), (2,2, 2, 3), (2, 2, 2, 4), +++, 
(12, 12, 12, 12), with the corresponding individual probabilities com- 
puted by the methods described in the preceding section. Then the 
joint probability of z and y could be obtained as follows: 


(2, ) P(z,y) 

(0, 0) Sum of probabilities for those points with 
no 7’s and no 11’s 

(0, 1) Sum of probabilities for those points with 
no 7’s and one 11 

(0, 2) Sum of probabilities for those points with 


no 7’s and two 11’s 


and similarly for the remaining possible combinations (0, 3), (0, 4), 
(1,0), @, 1), (1,2), (1,3), (2,0), (2,1), (2, 2), (3,0), (3, 1), (4,0). 
This procedure is lengthy, and so we proceed as below. 


Let 7 and 7 be two integers between 0 and 4 whose sum does not 
exceed 4. By the multiplication formula 2.4 


Plo=i;y=j] = P{z=i} Ply=j|z=4} 
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From earlier work we know that the probability of a 7 in a toss of 
two dice is 1/6. Since P{x=7} is just the probability of ¢ 7’s in four 


tosses, 
P{2=i} = Ca) '@4* 


Now P{y=j|z=1} is the probability of 7 11’s in the remaining 4—7 
tosses. The unconditional probability of 11 in a toss of two dice is 
1/18; but the conditional probability of this event, given that the 
toss is not a 7, is 1/18 divided by 5/6, or 1/15. Then 


Ply=j|z=i} = Cass Gh 47 
Thus the joint frequency function for x and y is 
Pix=iyy=s) = CrCsis BOG HY 


This may be rewritten as 
: ; 4! (: (2 ne ieee 
Plestia si) Tiga a= i) yD ‘) 


CONTINUOUS DISTRIBUTIONS 


In the preceding section, random variables were discussed, whose 
possible values were separated or “discrete,” such as the number of 
heads in n tosses of a coin, the number of good apples in a barrel, and 
so on, Here we turn to random variables which are, more properly 
and conveniently, regarded as ranging over a continuum of values. 
Examples are the weight of a carload of coal, the time between clicks 
of a Geiger counter, the temperature at noon in Miami. 

Continuous random variables are defined on sample spaces with a 
non-denumerable infinity of points. Because of the mathematical 
difficulties involved, we shall not develop the concept in detail, but 
retain the notion of a “‘trial’’ as a set of operations which result in some 
single definite (but random) value for x. Then, if a is some particular 
value on the range of z, the event “the value of x resulting from a 
trial is less than or equal to a” has a definite probability in the sense 
of this chapter. We call this probability P{x<a}. Inasmuch as 
P{x<a} is defined for each value a on the range of z, it is actually a 
function of a, which for brevity we shall denote by U(a): 


U(a) = P{x<a} 


The function U(a) corresponding to a continuous random variable x 
is usually called the “cumulative distribution function of x.” Some 
mathematicians prefer the briefer term “distribution function of =,” 
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although “cumulative distribution funttion” is frequently found in 
statistical textbooks. 

If a and b are two values on the range of x with a < b, we may 
denote by P{a<ax<b} the probability of the event “the value of x 
resulting from a trial is more than a but not more than b.” We can 
express P{a<x<b} in terms of the cumulative distribution function 
by first noting that 


P{x<b} = P{x<a} + P{a<x<b} see (2.2) 
Then 
P{a<x<b} = U(b) — U(a) 


In this section we will be concerned only with cumulative distribution 
functions U(a) that possess derivatives f(a) at all points on the range 
of «. In such cases f(a) = U’(a), and we have 


a b 
(2.17) U(a) -f I(x) dx; P{a<x<b} = [toa 


The function f(x), defined when the cumulative distribution function 
is differentiable, is called the “probability density function of 2,” or 
more briefly the “density function of «.” 

Some immediate consequences, for any random variable x with den- 
sity function f(x), are 


@) og f(a) dz = 1. 

(b) If a is a fixed value of x, P{z=a} = 0. 

(c) f(z) dx can be regarded as the probability that the random 
variable takes on a value between x and ++dzr. 

As in the discrete case, the probability distribution of a random 
variable which ranges over a continuum may be described (incom- 
pletely, of course) by its mean @ and variance Var (zx): 


(2.18) rhe re ” afte) de 
(2.19) Var (2) = f (e—#) f(z) de definition 


(2.19’) Var (z) = f: 2? f(z) de — (2)? in computation 


Several familiar examples of probability distributions with x ranging 
over a continuum are given below, along with the mean and variance 
for each. 
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Rectangular Distribution (Figure 2.5) 
Range of z: 0 toa. fa) 
Density: f(z) =-: 
a 
a la 
Mean: == = 
0 a x 


Figure 2.5. Rectangular distribu- 


Variance: Var (x) = Gas ‘as 
Normal Distribution (Figure 2.6) 
Range of z: —« to ~. 
Density: f(z) 1 So 
ity: = . 
ensity os ee exp oe 


Mean: =m. 


Variance: Var (x) = 0”. 


fx) fe) 
a 
m x 0 x 
Figure 2.6. Normal distribution. Figure 2.7. Exponential distribution. 
Exponential Distribution (Figure 2.7) 
Range of z: 0 to ~. 
Density: f(z) = ieee where ais a positive narameter. 
1 
Mean: ==-* 
a 
1 


Variance: Var (z) = as 
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Gamma Distribution (Figure 2.8) 
Range of z: 0 to «. 
pent 


—— 

T(a+1) 

eter, and a is a parameter greater than —1. [Here 

T(v) denotes the gamma function with argument v.] 
a+l1 

z= : 

b 


a+l1 


Variance; Var (z) = > 


Density: f(z) = “e—= where b is a positive param- 


Mean: 


f(x) 


0 x CY) 05 1.0 
Figure 2.8. Gamma distribution. Figure 2.9. Beta distribution. 


Beta Distribution (Figure 2.9) 
Range of z: 0 to 1. 


6+ 1)! 
Density: f(x) = eee t a — x)’ where a and b are 
both parameters greater than —1. 
Mekaeromm s 2 cles 
a+b+2 
Variance: Var (x) = (4) 
a+b+3/\a+b+2 


(This type of distribution often occurs when the random variable x 
is a proportion.) 


If x is a random variable, then any function of x ig also a random 
variable. For instance, the value obtained by squaring the value of a 
random variable z is a new random variable 2, defined by means of 
the equation z = 2”. If f(z) is the density function for x, the density 
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function g(z) for the new random variable z = 2(x) may be obtained 
in two ways: 


(1) by use of the relation 
S(@) dx = g(2) dz 
(2) by use of the relation 


f soa= fi aan 
Example 11 * 


‘A hardware store wishes to order Christmas tree lights for sale 
during the Christmas season, On the basis of past experience, they 
feel that the demand » for lights can be approximately described by 
the density function f(v); i.e. f(v) dv is the probability that the demand 
will lie between v and »+dv. On each light ordered and sold, they 
make a profit of a cents; on each light ordered but not sold, they sustain 
a loss of b cents. Determine the number of lights they should order, 
to maximize expected profit. 


SoLuTion 
Let x = number of lights ordered. 
The actual profit R is a random variable, which is really a function 
of both the random variable v and the decision variable z. 
Symbolically, the expected profit E(R) for a particular value of x 


is given by 
E(R) = f R(v, x) f(v) do 
) 


The function R(v, x), by the conditions of the problem, has the form 
RQ, 2) = av — b@ — v) for v<z 


Rv, x) = ax for v>z 
Then 


E(R) = f “av — We — 9) So) do-+ fans ao 


We maximize E(R) by differentiating with respect to x and setting the 
derivative equal to zero: Note that the differentiation problem here is 
one of differentiating integrals with respect to a parameter. Applying 
discussed in “Notes from MIT 


* This example is a modification of a problem 
husetts Institute of Technology, 


Summer Course in Operations Research,” Massac! 
1953. 
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equation (II.1) of Appendix II, we obtain as the condition for x 


o= -0f 10 deal Oe 
0 z 


Ii we let uw = f "Ko at 
0 


the above equation may be rewritten as 
—b U(z) + afl — U(z)] =0 
a 
oF U@) = ae 3 
The optimal order, then, is the solution x of the equation 


ts a 
2.20 if dv = — 
(2.20) 4 f(r) do Porta 
The graphical solution of this equation is suggested in Figure 2.10. 


U(x) 


Sikeat z 
Figure 2.10. Graphical solution of equation (2.20). 


Example 12 

A random variable z has range a to b and density f(z). What are 
the range and density function g(z) of the random variable z, if the 
value of z is obtained from the value of z by means of 


(a) z=2-a? 
(b) 2 = ka? 


SoLuTIoN 
(a) The event {z between 2 and 29+dz} occurs when and only when 
the event {a between zo+a and 29+a+dz} occurs. Thus 
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(2) dz = f(@ + a) dz 
Canceling dz, and dropping the subscript, 
g(2) = fe + 4) O0<z<b-a 


(b) The event {z between 2 and zo-++dz} occurs when and only when 

the event {x between zo/k and 29/k+dz/k} occurs. Then 
g(z0) dz = f(o/k)(dz/k) 

or g(z) = 1/k f(e/k) ka<z<kb 
Example 13 

An extrusion plant is engaged in making up an order for pieces 20 
feet long. Inasmuch as the extrusion process creates unusable material 
at both ends, somewhat more than 20 feet must be extruded. If more 
than 20 feet of good metal results, the piece is salable; if less than 20 
feet of good metal is left, the piece is scrapped. Assume that the ex- 
trusion length can be controlled exactly, that the total end scrap per 
piece is a random variable y with density f(y), and that the extrusion 
cost is C dollars per foot. Determine the optimum extrusion length 
if the quantity to be minimized is the extrusion cost per salable piece. 


SoLuTION 
Let 20 -+ x = extrusion length. The extrusion cost is 


C(20 + x) 
and the probability that a piece will be salable is the same as 
P{y <2} 


be i fly) dy 


Thus the extrusion cost per salable piece is 
€(20 + ) 


f Sty) dy 
0 
This has the derivative (with respect to x) equal to 


ff sor av — @ +20) Fe) 
0 


——— 


[ i; “fW) av] 
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z= (x + 20)f(x) 


Ze U(x) = [Addy 


Figure 2.11. Graphical solution of equation (2.21). 


Thus the condition that must be satisfied by the optimal z is 


ze 
2.21) (e+ 20) 400) = f fu) dy 
0 
This equation for x is interpreted graphically in Figure 2.11. 


If two random variables « and y both range over a continuum, the 
joint distribution of « and y is described by assigning probabilities to 


y 


x 


Figure 2.12. The region © (joint probability). 
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each of the events {(z, y) lies in ©}, where © is an arbitrary region of 
the plane. That is, for any region of the zy plane, there corresponds 
a definite probability that a “trial” will yield a point within the region. 
We shall describe these two-dimensional continuous distributions by 
means of a joint density function. 

The two random variables x and y are said to have the joint: density 
function f(x, y) if, for every choice of a region in the plane, 


P{ (2, ) lies in 9} = f. f. Tiey)-aede 


Immediate consequences of the definition are: 


@) ie [ise idedyoml, 


(b) If zo and yo are two fixed numbers chosen before the “trial,” 
P{x=20;y=Yo} = 0. 

(c) f(x, y) dx dy is the probability that the point resulting from a 
“trial” falls within the elementary rectangle with vertices (z, y), 
(a+dz, y), (e, yt+dy), @+dz, y+dy). 

If we are given the joint density function f(z, y) for x and y, the 
one-dimensional density function for either x or y alone may be ob- 
tained simply by integrating out the other variable. Letting g(x) 
denote the density function for «, and h(y) the density function for y, 
the explicit formulas are 


a) = f feway 


ny) = f few ax 


Example 14 ‘ : 
.Given that 2 and y are two random variables with the joint density 


function f(z, y), find the distribution of z+y and x—y. 


SoLvution 
Let wu=x+y. The probability of the event fu<aty<ut 


du}, for fixed wu, is just the probability that the random point will fall 


Figure 2.13. Region of integration in Example 14. 


in the shaded area of Figure 2.13. Thus 


f(u) du = i f f(a, y) da dy taken over the shaded area 
0 pu—ytdu 
-f f tewardy 
—ovu—y 
=du} flu-y,y) dy 


Canceling out the du on both sides, we have, for the density of u = 
r+y, 


seu) = ff fu-v,w) ay 


Similarly, we find, for the density function of u = x — y, 


fu) =f fluty,w dy 


PROBLEMS FOR SOLUTION 
Set II 


1. Give the (discrete) probability distribution for the sum of two consecutive 
digits in a random number table. (In such a table, the digit occupying any 
particular position is a random variable ranging over the integers 0 to 9 with 
probability 1/10 for each.) 


Probability eee 


2. Derive the formulas given on page 23 for the mean and variance of 
a random variable whose distribution is (a) binomial, (6) Poisson. 

3. In a sequence of throws with two dice, what is the probability that the 
first 5 will precede the first 7? 

4. Verify the formulas given on pages 31, 32 for the mean and variance of 
(a) the rectangular distribution, (6) the exponential distribution, (c) the gamma 
distribution. 

5. Prove that the two parameters which appear in the definition of the 
normal distribution are interpretable as the mean of x and the standard devi- 
ation (square root of the variance) of z. 

6. Given that 2 is distributed rectangularly between 0 and 3. Find the 
distribution of 

(a) 2a. 

(b) a+ 2. 

() x. 

7. The density function for the life (operating hours till failure) of a certain 
type of television tube is 

f(z) =0 z< 30 
f(x) = 80/2? z > 80 
You buy a new set containing three of these tubes. What is 

(a) the probability that all three tubes will still be functioning after 120 
hours? [Ans. 8/27] 

(0) the density function of the time till the first tube failure? 

8. The useful life u of a light bulb may be regarded as a random variable 
with density function h(u). If the bulb is observed to be burning at time 7’, 
what is the probability that it is still burning at time 7+? 
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CHAPTER 


Sampling 


Most operations research studies are concerned, to a greater or 
lesser degree, with random variables whose means and distributions 
cannot be known precisely. Though one often assumes a particular 
value for the mean, in order to test the effect of a change in some 
controllable variable, this contributes nothing to an understanding of 
process and is never a strong point in the study. The role of sampling 
theory in operations research is to systematize, so far as is possible, 
the assumptions that must be made concerning the means and distri- 
butions of relevant random variables. 

Inasmuch as the distribution itself is beyond the reach of direct 
measurement, one must make inferences on the basis of information 
supplied by “data.” The “data” may be regarded as direct measure- 
ments of the corresponding individuals in the population which is 
described by the inaccessible distribution. Such inferences can be de- 
fended only if the data are “representative”: i.e. if the corresponding 
individuals can be regarded as having been drawn without bias from 
the population as a whole. As examples of the false inferences which 
might be made on the basis of a non-representative sample, we might 
mention: an estimate of the average age of people in a city based on 
a sample of 100 interviews at the Old Folks Home; an estimate of 
average income based on a random sample drawn from a telephone 
directory, especially in such countries as India; a literacy survey 
conducted entirely by mail. 

40 
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The theory of sampling provides guides to the selection of a sample 
that will be representative, as well as an estimate of the degree of 
belief in the inferences made on the basis of a representative sample 
of a given size. In a typical situation, the inference is an estimate of 
the mean of the theoretical distribution, and the degree of belief in 
this estimate is expressed as the root-mean-square difference between 
the estimated mean and the true mean. 

This chapter provides a brief introduction to sampling theory. Some 
attention is also given to the problem of fitting a theoretical distribution 
to a histogram, and to simulated sampling by the Monte Carlo method. 


TYPES OF PHYSICAL SAMPLING 


In the simplest type of sampling, we draw a sample of size n in such 
u way as to insure that each of the NV members of the population has. 
the same chance of being included in the sample. A sample selected 
in this way is called a simple random sample. One way of drawing a 
simple random sample is to number every individual (if the population 
is finite), put the numbers on slips of paper, and draw lots. A more 
convenient method is to use a table of random digits. Such tables 
contain lists of digits so chosen that each digit between 0 and 9 has an 
equal chance of appearing at a given spot in a single column, and each 
two-digit number between 00 and 99 has the same chance of appearing 
at a given spot in a double column, and so on. 


Example 1 
Using a random number table, show how to draw a simple random 


sample of ten different individuals from a population with 632 members. 


SoLuTiIon 
We number the individuals from 000 to 631, and proceed to read 


down some three-digit column in a random number table: 


346 V 686 X 
248 V 905 X 
232 V 358 V 
383 V 221 V 
640 X 507 V 
366 V 232 X 
353 V 137 V 


We continue to write down three-digit numbers until we have ten 
different numbers lying between 000 and 631. Here our simple random 
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sample includes the members whose tags read: 346, 248, 232, 383, 366, 
353, 358, 221, 507, 137. 

This procedure obviously guarantees that each number between 000 
and 631 has the same chance of being included in the sample. 


Another useful type of sampling procedure is called stratified random 
sampling. The members of the population are first assigned to strata 
or groups, on the basis of some characteristic, and a simple random 
sample is drawn from each stratum. The individuals in all the samples 
taken together constitute the sample from the population as a whole, 


Number in Number in 
Stratum stratum sample 
1 M ny 
2 N; 2 ne 
3 Ns ng 
k M Mm 
Total N n 
If the component sample sizes m1, ng, «++, nz are so chosen that 
So Ho. (42) 
NM, Ne Ni\ N 


we have what is known as proportional stratified random sampling. 
Other important types of sampling procedure, not discussed here, 
are systematic sampling and cluster sampling. 


ESTIMATES FROM SAMPLES 
AND THEIR PRECISION 


Our purpose in sampling is to draw inferences about the parent 
population; we are usually concerned with estimating parameters of 
the parent population from the sample. Probably the simplest and 
most commonly required parameters are the mean and variance, but 
many others are possible. In order to judge how useful our sampling 
procedure is, we examine how our estimates would vary if we repeatedly 
used the same procedure. If the average value of the estimate over all 
possible samples is the population value, the method of estimation is 


said to be unbiased. If two procedures are both unbiased, we can 
compare them by considering the variance (over all possible samples) 
of their estimates. 

In general, we choose a sampling procedure which yields unbiased 
estimates with a variance as low as possible. There may, however, 
be a few occasions when the cost of avoiding bias is greater than the 
added value of a bias-free estimate; in such cases we could probably 
tolerate a small amount of bias. 

In what follows we will be thinking of a population (x) of N members, 
with a mean p and variance Var (x). Generally sampling will be with- 
out replacement; i.e., no member can appear in a particular sample 
more than once. 

If we draw a simple random sample (x), 72, «++, %n) from such a 
population, it can be shown that the sample mean, ¢ = (1/n) 2 xj, is 
an unbiased estimate of the population mean. Thus, if H(€) denotes 
the average or “expected” value of £ over all possible samples, we have 


(8.1) E@) =n 


n 
On the other hand, the sample variance, defined as > & (a; — 2), is 
not an unbiased estimate of Var (x). In fact, its expected value is 
[(n — 1)/n] Var (x); we can get an unbiased estimate by multiplying 
by n/(n — 1). That is, 
1 n 
DN Cre) 


teal 


is an unbiased estimate of Var (z). ' 
The sample mean @ is a random variable whose mean, according to 
(3.1), is x. The variance of the sample mean is given (exactly) by 


Vi 
(3.2) Varia) (: de ) “oe 


The term (1—n/N) is called the finite multiplier. Some such term can 
be intuitively justified because, if we sampled the entire population to 
estimate the mean y, we would obtain » every time, and the estimate 
would have no variance. In this case n = N, and (3.2) yields zero for 
Var (@). But, when N tends to infinity, we get the simpler form 


Var (2) 


(3.3) Var (@) = 
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This result is also true for a finite population when we sample with 
replacement: i.e., when there is no restriction on the number of times 
an individual can appear in a particular sample. 

In practice, we may be able to use an approximate value of Var (z), 
obtained from previously collected data, to obtain Var (2) in advance 
of our sample; or we may use our sample estimate of Var (x) sub- 
stituted in (3.2) to obtain an estimate of Var (2). 

In most practical cases the distribution of Z over Many samples can 
be taken to be normal, provided n is reasonably large (say over 20) 
and n/N is small. 


Example 2 

A preliminary series of observations yields the following weights 
(in pounds) for 20 schoolboys selected at random from a given age 
group. How many observations would be required if the estimate of 
the average weight of boys in this age group is to have 95% chance of 
being correct to +4 pound? 


W838 79.5 72.6 
75.3 80.9 68.1 
82.4 67.5 70.0 
80.1 71.6 75.0 
82.0 81.9 82.0 


eo JID 
He bo Oo Ore 


SoLuTIon 


We first compute an unbiased estimate of the variance of an in- 
dividual observation. We obtain 


E= 758, Xe —#)? = 579.51,  AyB(e — 2)? = 30.51 


Hence an unbiased estimate of the variance is 80.51; and the estimated 
standard deviation is the square root of this, or 5.52. 

We now assume that weights of all boys 
tributed with a standard deviation of 5.52 pounds, but with an unknown 
mean. A sample of n boys will have a mean that is normally distributed 


about the mean of all boys in the age group and that has a standard 
deviation of 


in the age group are dis- 


5.52 
Vn 


From tables we find that 95% of a normal Population lies within 
plus or minus twice the standard deviation of the mean. Since we 
want a 95% chance of our sample mean lying within half a pound of 


Vn 2 
or n = 488 
Example 3 


Suppose that we have an infinite population of individuals, some of 
whom possess a certain attribute while the rest do not. Let P be the 
proportion of individuals in the population that have the attribute. 
We take a simple random sample of size n, of which m are found to have 
the attribute. Show that 


m 
(3.4) =— 
n 
is an unbiased estimate of P, and also that 
Pau-P 
(3.5) pee 
nL 


is an unbiased estimate of the variance of this estimate. 


SoLuTion 
The random variable m, by hypothesis, follows the binomial dis-~ 
tribution for the number of successes in 7 trials with success probability 
Bs 
P(m) = CamP”(1 — P)”—™ 
We know that the expected value H(m) of m may be obtained from the 
formula E(m) = nP. Since n is a constant, we have 


E(B) -2(") =~ Hm) =~. 9P =P 


This proves that P gives an unbiased estimate of P. 
The exact value of Var (P) is given by 


Var (P) = Var (“) =E [= ie 2(™)/ 


1 
=—E{m — E(m)}? _ since nis a constant 
ni 


# 1 Ver pike nP(1 P) PU -P) 
n n n 
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Now p - 
(CS) 7 ee As 
al Sy ge 


Tx E(P*) 

on 
P Var (P) + [E(P)? 
Bil] SU SMITTY LACT aN 
BR P(1 — P) bee 
SHI nn—1) n—1 


oe Pd =P) = Var (P) 
n 


Hence (3.5) gives an unbiased estimate of Var (P). 


Example 4 

We try to predict the result of a mayoralty election in one of our 
large cities. Our poll includes interviews with 1000 randomly selected 
registered voters. Results of the poll are: 


For Candidate A: 532. 
For Candidate B: 468. 


On the basis of this information, we publish the prediction of A’s 
election. Election Day comes, and B wins 51.1% of the votes. How 
unlucky were we? 


SoLuTIoN 
In the language of Example 3, the attribute in question is “intention 
to vote for A.” The proportion P of the population possessing the 
attribute is 0.489; and the estimate P from a sample of 1000 is 0.532. 
The estimate P is approximately normally distributed with a mean 
of 0.489. The variance of P need not be estimated in this example, 
since we know the underlying P; we have as an exact result: 


P(L—P) _ (0.511)(0.489) 
1000 1000 
S.D. (P) = 0.0158 


Our observed result 0.532 differs from the expected result 0.489 by 
about three standard deviations. The probability of this occurring by 
chance is less than one in a hundred. We conclude that we were not 


Var (P) = = 2.499 x 10+ 
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at all unlucky, but either our “simple random” sample was simply 
not random, or a sizable proportion of the voters changed their minds 
between our poll and the election. 


We next give estimates of the mean and the corresponding variances 
for stratified random samples. Recall that the population is subdivided 
into k strata, of sizes Ny, No, ---, Nz with 2N;, = N. Asimple random 
sample of size n; is taken from stratum 1, of size nz from stratum 2, 
etc. with Dn, =n. Let uw be the true mean of the entire population, 
let wu, be the true mean of the hth stratum, and let @, be the observed 
mean of the sample of n, drawn from the hth stratum. Then it is not 
hard to show that an unbiased estimate of the true mean y is given by 


1 
(3.6) A= N x Nain 


That is, the estimate of the population mean is the weighted mean of 
the observed subsample means, where the weight applied to the sub- 
sample mean 4; is N;,/N. 

Let x, be the random variable associated with measurements on 
individuals in the hth stratum. With this notation, the variance of 
the estimate (3.6) for the population mean yu is given by * 


E(Na\? Na — mm Var (an) 
(3.7) Var (2) = =) ar an 


In the special case where the population consists entirely of one stra- 
tum, we have k = 1, Nj, = N, m = n, and the expression for Var (A) 


becomes, as in (3.2), 


vi 
(3.8) Var (a) = (1 = *) an 


If the sampling procedure is to draw a proportional stratified sample, 
we have 
and (8.6) reduces to 
E, 1 
(39) a= "* _ — sum of all the values in the sample] 
, 7 n 


* For the derivation of this formula, see W. G. Cochran, Sampling Techniques, 
p. 69, John Wiley & Sons, New York, 1953. 
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For proportional sampling, equation (3.7) simplifies to 


1 k 
(3.10) Var (a) = (: es ~) wl Varah) 


Example 5 

Suppose that the population is divided into & strata with the notation 
as above, and that the size of the total sample is fixed at n. Determine 
the optimal sizes 71, Ne, --- of the subsamples from the various strata, 
if the criterion for optimality is minimum variance for the estimate of y. 


SoLuTIoN 
We start with equation (3.7) for Var (a). We can ignore the constant 
factor 1/N, and hence seek to minimize 


k 
Var (a, 
Sie ee 
hel ™ 
subject to the constraint 


k 
ye hn 
hal 
We set up the Lagrangian function 
Var (a, 
f(y, M2, +++, Me, A) = ZNANa — MH) ates) + X(Zn, — 2) 
uz 
For a minimum, we equate the partial derivatives with respect to 
M4, +++, Me and ) to zero: 


of Ney, 

an, my are ir 
of Me 

Flt aaa 4 We 


Hence 


Ni V Var (tn) = Naon 
th =| a eae if we set WVar (a) = on 


From the second equation above, +/X has the value 
1 k 
Vi == Dion 
Nha 
so that we obtain as our final result 


Nh N, RoR 


(8.11) 5 
Y Nien 


n 
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In words, this result says that, for optimal allocation of a sample 
of given size to strata, the subsample from a stratum should be pro- 
portional to the product of the stratum size and the stratum standard 
deviation. 


Example 6 

A department store wishes to ascertain the average amount out- 
standing on its customer accounts. Past experience suggests that the 
sums outstanding are grouped as shown in Table 1. 


TABLE 1 
Variance of amount 
Stratum Range Relative frequency outstanding 
eee 
1 <5.00 0.20 1 
2 5.01- 20.00 0.40 6 
3 20.01- 50.00 0.25 20 
4 50.01-100.00 0.10 75 
5 100.01-250.00 0.03 650 
6 > 250.00 0.02 2500 


Show how to use stratified sampling to minimize the variance of the 
estimate of the mean amount outstanding for a sample consisting of 
5000 accounts. 


SovuTIon 
For convenience, we rewrite equation (3.11) as follows: 
ie Wi/N)on 
me" Z(Na/N)on 


The computations are arranged in Table 2. 


TABLE 2 

Stratum Var (zs) on Ni/N  (Ni/N)oe 

wee 
1 1 1 0.20 0.20 202 
2 6 2.45 0.40 0.98 992 
3 20 4.47 0.25 1.12 1133 
4 75 8.66 0.10 0.87 881 
5 650 25.50 0.03 0.77 779 
6 2500 50 0.02 1.00 1012 


Totals _— _ 1.00 4.94 5000 
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It may well happen that the store does not have enough accounts in 
some strata to take samples of the size indicated in the last column. 
Suppose, for example, there are 40,000 accounts in all. Then only 800 


of them are for amounts in excess of $250 (stratum 6). In this case, 


we would make a complete check of accounts exceeding $250 and re- 
allocate the remainder of the sample (4200) to the other strata in 
proportion to the 7, above. 

The final results would be: 


Stratum 1 2 3 4 5 6 
Sample size 213 1,045 1,194 927 821 800 
Stratum size 8000 16,000 10,000 4000 1200 800 


We compute the variance of the estimated mean from this sample, 
using equation (3.7): 


nist ek 15,955 6 
Vi = (0.20)? - —— - — + (0.40)? - —— . —_ 
ar (A) = (0.20)" « oo9 aig * 4" * 16,000 ° 1045 
+: a8 8,806 20 
; 10,000 1194 
3073 75 379 650 
0.10)? - —— - —-+ (0.03)? - —— - — 
een 4000 oa7 * ) 1200 821 
re 0 2500 
: 800 800 


Example 7 

For the population of accounts in Example 6, make a rough estimate 
of the variance of the sample mean, if the sampling procedure involves 
taking a simple random sample consisting of 5000 accounts from the 
entire population. 


SoLuTIon 

Remembering that » denotes the mean of the population, and ya 
the mean of the hth stratum, and introducing the notation xq; for the 
value of the ith account in the hth stratum, we have (exactly) 


N Var (2) = > © (ani — u)?_ by definition of the variance 
RT 


= DD Gai — on t+ on — 2)? 
me 
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= DX Cone wa)? + DL on — 4) 
++ a cross-product term which sums to zero 


= panies (zn) + Nate — 
This result may be rewritten: 
Ni 
Var (2) = Sp War (2a) + Gun — 19" 


To apply this result to the problem at hand, we would have to know 
the stratum means and the population mean, all of which are un- 
available. For a rough estimate, we shall assume that the stratum 
means lie in the middle of the strata, for the first five strata, and 
“guess” that the mean of the last stratum is $400. Then the popu- 
lation mean p is the weighted mean of these stratum means. We have: 


wa = 2.50 
we = 12.50 
3 = 35.00 
pa = 75.00 
us = 175.00 


Nn 
= — pp = 36.00 
H x wt 


Using these values, and the figures for Var (wn) given in the state- 
ment of Example 6, we compute 


N 
Yo — Var (en) = 84.6 
~N 


N, 
Ga — w? = 38271 
h N 


Var (x) = 3911.7 


Hence the estimate of the mean account size, if based on a simple 
random sample of size 5000, will have the variance 3911.7/5000 = 
0.782. Comparing with the result of Example 6, it is seen that there 
is a tremendous advantage in this situation in using stratified sampling 
rather than simple sampling. The difference is more dramatic here 
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than in most situations, because the complete sampling of stratum 6 
dictated by the optimal stratified procedure eliminated this stratum’s 
contribution to the variance. But, even in the absence of such fortu- 
nate circumstances, the precision of the estimate of the mean can 
usually be improved appreciably through judicious stratification. 


PROBLEMS FOR SOLUTION 
Set I 


1. Use a table of random numbers to obtain a set of 25 three-digit random 
numbers. Using these as page numbers in your telephone directory, copy 
down the last three digits of the first number on each page. Find the mean 
and variance of these 25 three-digit numbers. Are your results compatible 
with the assumption that the last three digits of a telephone number are 
uniformly distributed between 000 and 999? 


2. An experienced gambler alters his dice so that, in his opinion, the prob- 
ability of a 7 exceeds 0.30. Before using this equipment, he decides to test 
it by rolling the dice 100 times and noting the results. How many 7’s must 
he obtain in order to be 99% sure that the probability of a 7 is at least 0.30? 

[Ana. 41 sevens] 


3. A manufacturer wishes to produce electric fuses with no more than 1% 
defective. He checks quality every so often by taking a sample of 10 fuses 
from the line; if one or more of the 10 are defective, the manufacturing process 
is halted, and a search is made for an assignable cause. 


(a) How often will he fail to halt production when defects are running at 


2%? [Ans. 82% of the time] 
(b) How often will he needlessly halt production when defects are running 
at 0.75%? [Ans. 7% of the time] 


4, One method of sampling for a proportion is to fix in advance not the 
sample size n, but the number of observed successes m. If m is fixed, and if 
P is the relative frequency of the attribute in the population being sampled, 
determine 


(a) the probability distribution of the sample size n 
(b) the expected sample size. [Ans. m/P] 


5. A machine produces parts which average 1% defective; however, the 
percentage defective varies from batch to batch. What size batch would 
insure that only one batch in a hundred would contain more than 2% de- 
fectives? [Ans. n > 543] 


6. We wish to determine the percentage of boys born in a certain com- 
munity. It is known that the odds on a given birth being a boy are about 
1 in 2. How many birth records should be examined so that there is a 95% 
chance that the percentage is accurate to within +1%? [Ans. 9604] 
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7. A sample of five bearings is taken from an automatic grinder, and their 
diameters are measured. The results are: 


1.003 
1.001 
0.996 
0.997 
0.998 


The process is considered to be running satisfactorily if 99% of the output 
lies in the range 1 + 0.005. On the basis of this sample, decide whether the 
grinder needs adjustment. [Ans. Yes] 


8. A simple random sample of 16 households was drawn from a city area 
containing 9420 households. The number of persons per household in the 
sample were as follows: 


5.247 

6444 

343 3 

35 25 
(a) Estimate the total number of people in the area. [Ans. 37,680] 
(b) Approximate the probability that this estimate is between +10% of the 
true value. [Ans. About 0.75] 


9. We wish to estimate the value of farm products in the United States 
by means of a sample of 1000 farms.* Three different methods of stratification 
are available to us: by size of farm, type of farm, and type of operator. Using 
the information shown in Table 3, page 54, and assuming proportional sampling, 
determine which of the three methods of stratification is most effective. 
Compare the variance of the resulting sample with that of a simple random 


sample of 1000 farms. 
[Ans. Stratification by farm size yields a sampling variance of 
40.21 for the estimated mean. This is lower than the variance 
for the other methods of stratification.] 


CURVE FITTING 


It is frequently necessary to fit an algebraic function to the observed 
relationship between two or more variables. Thus, for example, we 
observe a series of pairs (xi, ys) {¢ = 1, 2, «++, n], and we wish to obtain 
y as a function of z. In order to do this, we must first guess the form 
of the function in terms of one or more parameters and then estimate 
the parameters so as to obtain a good fit. We are quite free to decide 
what we mean by a “good fit.” Frequently we agree to mean that the 
sum of the squares of the errors shall be a minimum. If our formula 


* This example is taken from Morris H. Hansen, William N. Hurwitz, and 
William G. Madow, Sample Survey Methods and Theory, Vol. 1, John Wiley & Sons, 
New York, 1953. 
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TABLE 3 
Stratum Number of farms _ Variance, 
Nn est. 

A. Size of farm, acres 
Under 10 593 ,937 17,910 
10-29 944,379 12,709 
30-49 707 ,544 13,891 
50-69 472,598 15,179 
70-99 685, 146 16,631 
100-139 634,611 24,821 
140-179 566,248 33,118 
180-219 283,091 51,362 
220-259 210,058 63,914 
260-499 473 ,923 97 , 223 
500-999 173,547 180, 767 
1000 and over 113,807 394,343 
All farms 5,858,889 47,393 

B. Type of farm 

Not classified by type 106 ,929 1,125 
Fruit and nut 132,873 222,300 
Vegetable 93,646 143,303 
Horticultural—specialty 14,841 424 325 
All other crops 1,860,644 41,571 
Dairy 558 , 667 48,739 
Poultry 273,129 69,594 
Livestock 809,817 90,392 
Forest products 30,645 8,807 
General farms 688 , 807 24,502 
Subsistence 1,288,891 146 
All farms 5,858,889 47,393 

C. Type of operator 
Full owners 3,292,063 35,683 
Part owners 661,156 106,153 
Managers 47 ,357 509,610 
Tenants—cash 410,091 36,205 
share-cash 137,380 48,164 
share 682,561 40,052 
croppers 446 , 850 2,696 
Other and unknown 181,481 29,488 
All farms 5,858,889 47,393 


Sanpige eee ee eee 
gives 9; = f(a, b, ---, k;,), then we choose a, b, ---, k so as to min- 
imize 
»2, (yi — 93)? 
7 
The simplest example is when f = ax + 6 and we minimize 
¥ wi — ax; — 6)? 


The details are well known; they may be found in any standard 
statistics text, and will not be repeated here. 

A second problem arises when we have a series of observations 
a;(i =1,-+--,n], and wish to estimate the probability distribution 
function that fits them best. Again we are free to decide what is meant 
by a good fit, and here it is more convenient to agree that the best fit 
is that which makes as many as possible of the observed moments equal 
to the corresponding moments of the fitted distribution. Let p(a, b, 
-++,k; 2) be the distribution we wish to fit, and let m = m(a, b,---, k) 
be the mean of this distribution. Then, if # is the mean of the obser- 
vations, we determine the parameters a, b, ---, k from the simultaneous 
equations 


z= m(a,b, ---,k) first moment (mean) 


n LJ 
: YL@- HD = f (x — m)? p(a, b, --+,k;x) dx second moment 
Niet ries (variance) 


n 0 
ry = p= f ( — m)* p(a, b, «++, k; x) dx third moment 
Nix —~ 
and so on, where we use as many equations as there are parameters. 
Note that the method requires advance specification of the function 
p(a, b, »+-, k;«); it then enables us to find the parameters a, b, ---, k. 
We can often guess at the form of p from past experience or, if we have 
enough data, from the shape of the histogram. 
The method works equally well when the theoretical distribution is 
discrete. We need only replace the integrals by summations. 


Example 8 
Fit a normal curve to the following series of observations. 


15 17 16 5 20 13 11 9 15,21 15 6 5 


Sis) 10) dOU98: 1b Ay 13°20 11 17 21 17 
16 13 12 14 13 14 13 11 8 16 15 12 15 
95 15 11 14 19 12 10 84 20 13 10 9 
23 13 12 11 14 19 13 14 6 11 11 15 18 
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SoLuTion 

The normal curve has only two parameters, the mean and standard 
deviation. Once these are known, the distribution is completely de- 
fined. Thus we compute the mean and standard deviation of the ob- 
served data, and we have the required normal curve. 

The reader can verify that the mean is 13.54 and the standard de- 
viation is 3.67. 

The probability density function for the normal distribution with 
mean m and standard deviation ¢ is 


1 [== - ae 
oV 2 on 207 
We can use standard tables of the normal distribution to obtain 


properties of our fitted distribution, remembering that these tables 
give ordinates and areas for the variable 


z—m «—13.54 
“ur= ae aS 


o 3.67 

Example 9 

The gamma distribution, described in Chapter 2, page 32, often 
furnishes a good fit for observations that are intrinsically non-negative 
and of fairly wide range. Among other applications, it has proved 
useful in the analysis of weekly sales data, and in connection with 
certain inventory models. 

Suppose that a metal fabricating company has kept track, over a 
period of 80 weeks, of its weekly consumption z of a particular kind of 
alloy, and that the results are as follows: 


bi Number of weeks 

0- 99 3 
100-199 3 
200-299 6 
300-399 14 
400-499 13 
500-599 15 
600-699 il 
700-799 9 
800-899 2 
900-999 4 
80 


Fit a gamma distribution to these data. 


SoLuTion 
The gamma distribution has the form (apart from a constant factor) 


2%e—%*, with mean (a + 1)/b and variance (a + 1) /b?. The observed 
distribution of z has mean 506 and variance 45,200, as the reader may 
verify. Using the method of moments, we determine the parameters 


a and b from the equations 


1 
ahd eens 

i 
a+1 

we = 45,200 


obtaining a = 4.57, b = 0.011. 


Number of weeks 
Oe kere a 8 
Afi oie at 


a 


0 Too 200 300 400 500 600 700 800 900 1000 
Weekly consumption 


Figure 3.1, Fitting a gamma distribution to the data of Example 9, 


Example 10 
An owner of five overnight cabins is wondering whether he should 


build a sixth cabin. His records over the past 100 days show the follow- 
ing occupancy figures: 


Number 
Nightly of 
occupancy evenings 


ar WNe © 
cs 
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Estimate the fraction of evenings that a sixth cabin would be occupied, 
if built. 


SoLuTION 
Inasmuch as no records are available for the number of customers 
turned away, we must settle for the following demand history: 


Nightly Number of 
demand D evenings n[D] p(D) 
0 1 0.01 
1 3 0.03 
2 8 0.08 
3 14 0.14 
4 18 0.18 
5, 6, 7, 8 +°* 56 = 


The next step is to fit some theoretical discrete distribution to the 
observed relative frequencies p(0), «+, p(4). The Poisson is a natural 
first choice to consider. Examining a table of the Poisson distribution, 
specifically those columns with the mean ) = 2, 3, 4, 5, 6, we find that 
the choice \ = 5 yields an excellent fit, viz.: 


Observed Theoretical 


D JD) J) 
0 0.01 0.007 
1 0.03 0.034 
2 0.08 0.084 
3 0.14 0.140 
4 0.18 0.175 
5 = 0.175 
6, 7,8, ++° = 0.385 


All six cabins will be occupied if the nightly demand exceeds 5. 
Thus the fraction of evenings with 100% occupancy would be approxi- 
mately 0.38. 


SIMULATED SAMPLING (MONTE CARLO) 


Situations frequently arise where some method of sampling is in- 
dicated, but where the actual taking of a physical sample is either 
impossible or too expensive. In such situations, useful information can 
often be obtained from some type of simulated sampling. Typically, 
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simulated sampling involves replacing the actual universe of items by 
its theoretical counterpart, a universe described by some assumed 
probability distribution, and then sampling from this theoretical popu- 
lation by means of a random number table. The methods of taking 
such a sample, as well as the discussion of decision problems which 
rely heavily on such sampling methods, are often referred to by the 
catch-all label of Monte Carlo Methods. 

To draw an item at random from a universe described by the prob- 
ability density f(x), one proceeds as follows: 


(a) Plot the cumulative probability function * 


y = F(a) = f flu) du 


(b) Choose a random decimal between 0 and 1 (to as many places as 
desired) by means of a table of random digits. 


Random 
decimal 


x 


0 Sample x 


Figure 3.2. Monte Carlo sampling: ‘Drawing an item” from a population with 
cumulative distribution function F(z). 


(c) Project. horizontally the point on the y axis corresponding to 
this random decimal, until the projection line intersects the curve 

= F(z). 

(d) Write down the value of x corresponding to the point of inter- 
section. This value of z is taken as the sample value of x. 


In order to justify this procedure, we must show that any item in 
the population has the same chance of being “measured” as any other 
item. Equivalently, we want to show that the probability of ending 
up with a measurement between 2 and 2,+dz is proportional to the 


* The method would work equally well if one plotted the complement 1 — F(z), 


60: 2 Operations Research: Methods and Problems 


density f(2;). Referring to the left-hand diagram of Figure 3.3, we see 
that, 

Pix, < resulting c < 2; + dz} = dy 
and, from the right-hand diagram, we have 


dy, = f(1) dx 
Thus 
P{a, < resulting x < 2 + dx} ~f(x1) dx 


as desired. 
The procedure need not be carried through in detail for either the 
rectangular or the normal distribution, since tables of random deviates 


dy, 
¥ = F(x) 
dy, (= F'(x,) dxy 
ee = f(x;) dx) 
0 % xy +dx xy xy tdx 


Figure 3.3. Diagram used in justification of Monte Carlo sampling routine. 


are available for both distributions. For the rectangular distribution, 
we use tables of random digits, and read a random decimal from an 
arbitrary spot in the table. This yields a number between 0 and 1. 
If the distribution in question is rectangular between 12 and 15, say, 
we multiply the random decimal by 3 and add 12 to the result to 
obtain our sample value. For the normal distribution, we use tables 
of random normal deviates, obtained from a population with mean 
zero and standard deviation unity. The corresponding sample value 
is obtained by multiplying the random normal deviate by o, and 
adding yu. 


Example 11 

The range of a sample is the difference between the largest and 
smallest observations in the sample. Estimate the mean range in 
samples of five drawn from a normal distribution. 
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SoLUTION 
We read off a sample of five from the random normal number tables 
and obtain, e.g.: 
1.119 


—0.792 
0.063 
0.484 
1.045 


This sample has range 1.911. Continuing in this way, we obtain ranges 
of 1.334, 2.633, 3.089, 3.844, 3.013, 3.825, 1.073, 2.488, 1.292, 2.390, 
2.796, 2.841, 2.274, 2.827, 1.633, 1.112, 1.730, 3.353, 3.783, 1,433, 2.828, 
3.883, 2.621, 2.250. These 25 separate observations of the range have 
a mean of 2.490. 

Thus our estimate of the mean range in samples of five from a normal 
distribution with standard deviation o (and any p) is 2.490c. This 
compares with the exact value for the mean range * in samples of 
five of 2.326c. 


Example 12 

A bombing mission is sent to bomb an important factory, which is 
rectangular in shape and has the dimensions 250 by 500 feet. The 
bombers will drop 10 bombs altogether, from high altitude, all aimed 
at the geometric center of the plant. We assume that the bombing 
run is made parallel to the long dimension of the plant, that the de- 
viation of the impact point from the aiming point is normal with mean 
zero and standard deviation 200 feet in each dimension, and that these 
two deviations are independent random variables. Use Monte Carlo 
sampling to estimate the expected number of bomb hits, and compare 
your result with the exact value. 


SoLuTION 
Let x be the horizontal deviation, and y be the vertical deviation, 


as shown in the diagram. A hit will result if both the following events 
oceur: 

—250 < x < 250 

125 <y < 125 


Otherwise the bomb will be a miss. 
*See A. Hald, Statistical Tables and Formulas, p. 60, John Wiley & Sons, New 
York, 1952. 
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z impact 
point 


Flight tine 


————> _ 2501 


500 ft 
Figure 3.4. Flight line and target area in Example 12. 


If we set 


so that u and » will be the corresponding deviates read from a random 
normal number table, then the condition for a hit becomes 


—1,250 su < 1.250 
—0.625 < v < 0.625 


Results of the first three trials are given in Table 4. 

These three trials yield 3.33 as the average number of hits per 
mission. Many more trials should be conducted before one can have 
any real confidence in the result. One way of estimating how many 
trials are necessary is to list the cumulated mean at the end of each trial, 
and to stop the trials when the mean seems to have settled down to a 
stable value, In this example we have 


After trial number: rt tery 
Cumulated mean is: 4 3 38.33 


so that more trials are necessary. 

In this problem, unlike most Monte Carlo problems, an exact cal- 
culation of the answer is much easier than the Monte Carlo calculation. 
The probability of a hit with a single bomb is just 


(ff “fo au} x { [10 as} 


= 0.789 X 0.468 from tables of the normal integral 
= 0.369 


asctert) 163 


TABLE 4 


Sampling 


Result 


Bomb 


PETES SS Et 


Aare or nes 


SE2S2F2582 


a- 75 o*eTir 


F 1 it 


Basccosooscscon 
BMiLbLI sia 1 


k 


‘The mean number of hite in a mission dropping 10 bombs is just 10 


times this, or 3.69. 


Example 13 


delivers fresh pumpernickel bread to one of its retail 
os ne number of loaves delivered each day is not con- 


A bakery 
stores each day. 
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stant but has the following distribution: 


Loaves per day Probability 
10 0.05 
ll 0.10 
12 0.20 
13 0.30 
14 0.20 
15 0.10 
16 0.05 


The number of customers desiring pumpernickel bread each day has 
the distribution: 


Number of customers Probability 


ocnanan 


1 


Finally, the probability that a customer in need of pumpernickel 
bread wants 1, 2, or 3 loaves is described by: 


Loaves to a customer Probability 
1 0.40 
2 0.40 
3 0.20 


Estimate by Monte Carlo methods the average number of loaves of 
pumpernickel left over per day, and the average number of sales 
(loaves) lost per day owing to lack of bread. Assume that left-over 
bread is given away at the end of each day. 


SoLuTion 

The Monte Carlo sampling routine of the text applies equally well 
to discrete distributions. The first step is to plot (or visualize) the 
cumulative distributions for the random variables in question. These 
are displayed in Figures 3.5, 3.6, and 3.7. 

We shall follow through the purchase and sale of bread for 10 days. 
A larger number of days would, of course, give a more reliable answer. 
First we compute the number of delivered loaves on each of the 10 
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1.00 
0.90 
0.80 
0.70 
0.60 
0.50 
0.40 
0.30 
0.20 
0.10 


Pr {No, delivered = X} 


Figure 3.5. Cumulative distribution for the daily number of delivered loaves. 


days. This is done by taking random numbers from 000 to 999 and 
applying the Monte Carlo sampling routine, using Figure 3.5. The 
results are given below; the numbers in parentheses are the random 
numbers used to determine the sample value of the variable: 


13 (608) 10 (022) 
15 (861) 10 (048) 
12 (215) 10 (029) 
13 (380) 12 (333) 
14 (775) 14 (844) 
1.00 
0.90 
5 0.80 
vi 
0.70 
5 0.60 
3 050 
° 
3 0.40 
= 
© 0:30 
0.20 
0.10 


See ea OT 


Figure 3.6. Cumulative distribution for the daily number of customers. 
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Next we compute the daily number of customers, using Figure 3.6: 


10 (981) 
6 (240) 
6 (207) 

10 (987) 
7 (320) 


For each of these 77 customers, we compute the number of loaves 
wanted by means of Figure 3.7. The results of the procedure are 
given as the columnar entries in Table 5. The sum for a column gives 
the demand for that day; and the difference between demand and 


8 (574) 
8 (502) 
7 (396) 
8 (815) 
7 (430) 


supply gives the daily surplus or shortage. 


TABLE 5 
Day 

i igs AF My untgrdiagid 7rieeBemg? 1 
Ls cational pelle gAtsh2 i acs ela? 
Pile home os ts) 23 lng Bi a! 
Bil etal obediawludi®om2riakts 2 blu 
4)\B) OBink2i Baddow Us QeeQw! dowd 
Customers 5| 2 1 2 1 1 2 2 2 8 2 
ee C\RRDent 2 I 1 2 2 
7| tam gu eareai <8) 12) <2 
s}i1—-— 1-— 3 8-1-— 
9)1—-~ 2-—-~—-—-—~ 
wo} 2—-—- 1-—~—-=—--—— 
Daily demand | 17 10 9 14 10 18 14 16 13 18 
Daily supply |13 15 12 13 14 10 10 10 12 14 
Lost sales ES De ae ee eS 
Leftovers —- 63—-4-—-~—-—-— 1 


In the 10-day period there were lost sales amounting to 24 loaves, 
and leftovers amounting to 13 loaves. So the average number of lost 
sales per day is 2.4 loaves; and the average daily number of left-over 


loaves is 1.3. 
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OTe yei? ined x 


Figure 3.7. Cumulative distribution for the number of loaves wanted by a 
customer. 


PROBLEMS FOR SOLUTION 
Set IV 


1, Estimate the mean range in samples of 10 from a rectangular distribution. 
Compare your result with the exact value of this parameter. (The distribution 
of the range in samples from a rectangular distribution can be found in Paul 
G. Hoel, Introduction to Mathematical Statistics, 2nd ed., p. 241, John Wiley 


& Sons, New York, 1954.) 

2. Suppose we are studyi the inventory situation in a plant, and are 
interested in generating possible sales for 30 days. Assuming that the number 
of sales per day is Poisson with mean 5, generate 30 days of sales by Monte 
Carlo methods. [Hint: Use tables of the cumulative Poisson distribution.] 


3, An automobile production line turns out about 100 cars a day, but de- 
viations occur owing to many causes. The production is more accurately 


described by a probability distribution, viz.: 


Production per day Probability 
95 0.03 
96 0.05 
97 0.07 
98 0.10 
99 0.15 

100 0.20 
101 0.15 
102 0.10 
103 0.07 
104 0.05 
105 0.03 


68 ______--- Operations Research: Methods and Problems 


Finished cars are transported across the bay at the end of each day, by ferry. 
If the ferry has space for only 101 cars, what will be the average number of 
cars waiting to be shipped, and what will be the average number of empty 
spaces on the boat? 


4, An airline has 15 flights leaving a given base per day, each with one 
stewardess. The airline has a policy of keeping three reserve stewardesses on 
call to replace stewardesses scheduled for flights who become sick. The 
probability distribution for the daily number of sick stewardesses is as follows: 


Number sick Probability 


aronro 
o 
i 
o 


Use Monte Carlo methods to estimate the utilization of reserve stewardesses, 
and also the probability that at least one flight will be canceled because no 
stewardess is available. Compare with the exact answers. 


5. A cab agency finds that the economic life distribution of its cabs is 
described by the following table: 


Months until replacement Probability 


6 0.05 
8 0.10 
10 0.15 
12 0.20 
14 0.20 
16 0.10 
18 0.10 
20 0.05 
24 0.05 

1.00 


If the company maintains a fleet of 20 cars, estimate the number of replace- 
ments that will be made in a period of two years. 


6. Three points are chosen at random on the circumference of a circle. 
Estimate the probability that they all lie on the same semicircle, by Monte 
Carlo methods. [Ans. 0.75] 


7. A manufacturing company located in Cleveland has three warehouses: 
one in Los Angeles, one in St. Louis, and one in New York City. The present 
stock levels in the warehouses are 400, 500, and 600, respectively. The daily 
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demand at each warehouse follows a Poisson distribution, with means 12, 15, 
and 20, respectively. A strike at the factory has just occurred, and the factory 
is to be shut down for an indefinite period. If the factory has 400 items in 
stock, and will ship 15 times the average daily demand to any warehouse whose 
stock level drops below 15 times the average daily demand, how long will it 
take for at least one warehouse to be out of stock? 
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CHAPTER 


Inventory 


Inventory is the physical stock of goods that a business keeps on 
hand in order to promote the smooth and efficient running of its afffairs. 
It may be held before the production cycle, in the form of raw-material 
inventory; at an intermediate stage in the production cycle, as in- 
process inventory; or at the end of the production cycle, as finished- 
goods inventory. For definiteness, we shall confine our discussion for 
the most part to finished-goods inventory, but the procedures developed 
can be adapted with minor changes of wording to other kinds of inven- 
tory. 

A certain amount of inventory is usually necessary, but a business 
can function with some degree of efficiency over a fairly wide range of 
inventory levels. Control over inventory can be exercised by changing 
the timing of production runs, by changing the size of the runs, and by 
changes in promotional effort or sales inducements. Here we consider 
that the marketing activity is either constant or beyond the province 
of the inventory planner, so that an inventory policy will involve only 
the timing and amounts of the production runs. 

Possible advantages associated with increased inventory are the 
economies of production with large run sizes, faster shipment of orders 
to customers, stabilized work loads, and profits from speculation in a 
market where prices are expected to rise. The disadvantage associated 
with increased inventory is simply that the holding of inventory costs 
money (¢.g., warehouse rent, depreciation and deterioration, interest 
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on invested capital, physical handling and accounting). Clearly it is 
desirable to increase inventory only when the resulting savings (or 
profits) more than outweigh the cost of the increase. 

In the examples presented here we will consider only the first two of 
the advantages mentioned above resulting from increased inventory: 
namely, préduction economies with large run sizes, and the reduction 
of delay in filling customer orders. Although stability of work load and 
speculation sometimes loom large in the making of inventory policy, no 
simple theory is available at present for such problems, and a discussion 
of the attempts at solution would be outside the scope of this book. 


KNOWN DEMAND 


In this section we consider first a very simple situation in which a 
contractor has a firm order to supply goods at a uniform rate R per 
unit time. He starts a production run every ¢ time units, where ¢ is 
fixed; and his setup cost each time hestartsarunisC3. Production time 
is negligible; so there will be no delay in fulfilling demand as long as a 
new run is started whenever inventory is zero. We assume that the 
contract is such that its terms must be satisfied, and that zero inventory 
will always be the signal for further production. The cost of holding 
inventory is assumed to be proportional to both the amount of inven- 
tory and the time the inventory is held. Then the cost of holding 
inventory I for time 7 may be denoted by C,/T,, where C; is the cost 
of holding one unit in inventory for one time unit. The question to be 
answered is: How frequently should a production run be made, and 
what quantity of goods should be produced? We are concerned at this 
point with balancing the economies of increased run size against the 
cost of holding inventory. 

Figure 4.1 shows the elementary inventory situation we are consider~ 
ing. If a run is to be made at intervals ¢, an amount Rt must be 
produced at each run. The shaded area represents 


t 
fra 
0 


and has the value }R/. Thus the cost of holding inventory is 4C,R? 


per production run. 
The cost of setup is C3 per production run. Hence the average total 


cost per unit time is 


¢ 
C= 4CRt+ # 
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For a minimum, we differentiate C with respect to ¢ and set the deriva- 
tive equal to zero, obtaining 


(4.1) t= V 203/CiR 


As the reader can verify, the quantity q to be produced at each run 
becomes 


(4.2) g = V2C3R/C, 
and the resulting minimum cost has the value ~/2C,C3R per time unit. 
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a 
Time 
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Figure 4.1. Demand constant from month to month, shipment uniform during 
a month, no shortages permitted. 


Equation (4.2) has been termed the “economic lot size formula”; the 
result was first given by F. W. Harris in 1915. It will be realized that 
some of the assumptions made are not likely to be satisfied in practice. 
It seldom happens that customer demand is known exactly or that 
production time is negligible. 

We can generalize the situation somewhat and obtain a similar 
formula for the optimal lot size. Let us assume that the contract 
specifies merely some total demand D, to be satisfied during some long 
time period 7, with the proviso that output will be shipped to the 
customer uniformly between production runs. We decide that each 
time a production run is made, we will make some fixed quantity g. 
Figure 4.2 shows the resulting inventory situation. 

There will be D/q production runs made during the period 7 in 
question. If ¢, is the time interval between tun | and run 2, and fg is 
the time interval between run 2 and run 3, ete., then the holding costs 
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Figure 4.2. Fixed run size and fixed number of runs, with discretion as to 
scheduling of runs. 


Time 


for the period T' will be 
3Cig(h + te +e-++ ty) = 2Cx9T 


and the setup costs will be 
C3D 


qd 
Denoting the total cost with a fixed run size q by C(q), we have 


C3D 
C(q) = 3CaT + =F 


The total cost C(q) attains its minimum for a value of g given by 


2C3(D/T) 
(4.3) ae ene 


Note that (4.3) is similar to (4.2), except that the fixed demand rate R 
is replaced by the “average” demand rate D/' i 
Other variations are provided in the following examples. 


Example 1 : ; 
‘A contractor has to supply 10,000 bearings per day to an automobile 


manufacturer. He finds that, when he starts a production run, he can 
produce 25,000 bearings per day. The cost of holding a bearing in 
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stock for one year is 2 cents, and the setup cost of a production run is 
$18.00. How frequently should production runs be made? 


SoLuTion 

We assume that run sizes are constant, that a new run will be started 
whenever inventory is zero, and that the sole reason for producing for 
inventory is to obtain lower production costs. 

We use the following notation: 


F = number of bearings required per day 

k = number of bearings produced per day 
C, = cost of holding one bearing in inventory per day 
C3 = cost of setting up a production run 

q = number of bearings produced per run 

t = interval between runs 


Figure 4.3 gives a sketch of the way inventory varies with time. In 
this figure, the slope of the line NB is the rate of depletion of inventory 


Inventory 


Figure 4.3. Inventory situation for Example 1. 


and has the value —R; the slope of ON is the difference between 
the rate of production and the rate of depletion, and has the value k—R. 
Production will continue for a time t = qg/k, and the period of the 
entire inventory cycle has the value ¢ = q/R. The area of triangle 
ONB, multiplied by the holding cost of one bearing per day, represents 
the inventory holding cost per cycle; this cost, plus the cost of one 
setup, gives the total cost per cycle, 

To obtain the area of triangle ONB, we first note that its base is ¢. 
The altitude NC is the inventory level at the moment a production run 
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is completed, or g — Rt, = g(1 — R/k). Thus the area of the shaded 
triangle is 3gé(1 — R/k), and 


Cost per cycle = Cz + 4C,qi(1 — R/k) 
F = cost per day = C3/t + $C,q(1 — R/k) 
= C3R/q + 3Cig(1 — R/k) 


To obtain the optimum run size, we differentiate F with respect to 
q and set the derivative equal to zero. The result is 


a pe mm Cs 
1 VG —R/b ~ VRC, — R/b) 


Here C; = 2 cents per bearing per year = $0.000055 per day 
C3 = $18 
R = 10,000 bearings per day 
k = 25,000 bearings per day 


Then 


2X 18 X 10* NORE AOL 
= .|————- = V 11.091 X 10'° = 105,000 beari 
q 55x10 — $2) x earings 


and 


t = 10.5 days 

In the previous example, the sole objective of holding inventory was 
to achieve economies of increased run sizes. We assumed implicitly 
that the costs associated with failure to meet a delivery schedule were 
so high that such an eventuality would not be allowed to occur. In 
the rest of the examples of this chapter, we allow the cost of failure to 
meet a delivery schedule to be finite. It will be seen that there are 
situations where the total cost can be reduced by deliberately failing 
to meet some of the scheduled delivery dates. 


Example 2 

A subcontractor undertakes to supply Diesel engines to a truck 
manufacturer at the rate of 25 per day. There is a clause in the con- 
tract penalizing him $10 per engine per day late for missing the sched- 
uled delivery date. He finds that the cost of holding a completed 
engine in stock is $16 per month. His production process is such that 
each month (30 days) he starts a batch of engines through the shops, 
and all these engines are available for delivery any time after the end 
of the month. What should his inventory level be at the beginning of 
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each month (i.e., immediately after taking into stock the engines made 
in the previous month, and then shipping engines to fill unsatisfied 
demand from the previous month)? 


SoLurion 
In this problem we are concerned with balancing the costs of holding 
inventory against the costs of delayed deliveries to customers. Since 
the subcontractor has already decided to produce a batch of engines 
every month, he has fixed the size of each batch at 750 engines, and we 
may assume that he is no longer concerned with economies in run size. 
We shall use the following notation: 


C; = cost per day of holding an engine in inventory 
C2 = penalty cost per day of failing to deliver one engine on schedule 
R = contracted number of engines per day 

z = planned inventory level at beginning of a month 


Our problem is to find the value of z that minimizes costs. 


% 


Y 


Figure 4.4. Inventory situation for Example 2. 


Time 


The inventory situation during each month is represented approxi- 
mately by Figure 4.4. The cross-hatched area represents inventory, 
and the dotted area represents failure to meet delivery. (Although 
delivery actually takes place in batches of R engines per day, we are 
approximating to the actual situation by assuming continuous delivery 
at the rate of R per day.) With this approximation, the number of 
“engine-days” of inventory is given by the area of triangle OAB, and 
the cost of holding inventory is C, times this area. By similar triangles 
0B/30 = z/30R, and OB = 2/R. Therefore the area of triangle OAB 
is 2*/2R, and the cost of holding inventory is Cyz?/2R, 

The number of “engine-days” by which the delivery schedule is 


missed is given by the area of triangle BDC, easily computed to be 


(30R — z)?/(2R) 
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Hence the cost of shortages is (C2/2R)(30R — z)?. 
‘The total cost F(z) is found by adding these two costs. Therefore 
C2? 0,(30R — z)? 


2k ‘ 2k 


For the minimum cost dF/dz = 0, and we obtain 
C2 
C, + Co 


It will be observed that, since C2/(C; + Cz) < 1, the optimum 
inventory level is less than the number of engines to be delivered each 
month, and we would deliberately plan for a shortage. Here R = 25, 
C, = 16/30, C2 = 10, so that the best starting inventory is 
30 X 25x ty 712 engi 
* Py Sgagie— 

It is not difficult to verify that such a policy reduces inventory costs 
in the ratio Cg:(C;+Cs) as compared with starting each month with 
an inventory 30R and never failing to meet the delivery schedule. 

Before reaching a decision to operate with a policy of deliberately 
missing scheduled delivery dates, careful consideration of the suitability 
of the model would be required. We have assumed that the cost ol 
missing delivery dates is the $10 penalty per engine per day late, as 
specified by the contract. Further analysis might show that this is a 
considerable underestimate, and that a policy of this type would result 
in a loss of customer goodwill, with reduced chances of the contract 
being renewed. If this were so, the true value of C2 might be much 
higher than $10 per engine per day late. 


If there are no setup costs associated with starting the production 
of a batch of engines, the decision to start a batch every month may be 
correct. However, if setup costs are not zero, there is an interval 
between setups which will minimize costs for a given starting inventory 
level. In the next example we will determine the optimal combination 
of starting inventory level and interval between setups. 


Example 3 (modification of previous example) | 

A contractor undertakes to supply Diesel engines to a truck manu- 
facturer at a rate of 25 per day. He finds that the cost of holding a 
completed engine in stock is $16 per month, and there is a clause in the 
contract penalizing him $10 per engine per day late for missing the, 
scheduled delivery date, Production of engines is in batches, and each 


F@ = 


z= 30R 
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time a new batch is started there are setup costs of $10,000. How 
frequently should batches be started, and what should be the initial 
inventory level (in the sense of the previous example) at the time each 
batch is completed? 


SoLuTion 
In these circumstances economies in production costs per engine can 
be achieved by increasing the batch size. Such economies will be offset 
by the resulting increase in inventory cost; so it is necessary to consider 
simultaneously the inventory level and the batch size. We will use C3 
to denote the setup cost and ¢ for the interval between starting batches. 
Arguing as in Example 2, if z is the planned inventory level on com- 
pletion of a batch of engines, we can show that the costs of holding 
inventory together with the penalty costs of missed deliveries will total 
C12" fs C2(tR — z)? 
2k 2R 
over the interval t between batches, 
In addition, there is now a setup cost C3, so that the average cost 
over time F(t, z) is given by 
1[Ciz CatR — 277 C, 
F(t,2) [+S +2 
tL 2k 2k t 


(It is necessary to consider average cost, in order to make comparisons 
between different values of t.) For minimum cost, aF/az = 0, and 
oF /at = 0. 


oF 1 ES C2(tR — a tRC2 

Fabien F | |e See 

oz ¢tLlR R 1+ Cy, 

oF 1[Cyz?  Co(tR — 2)? C2(tR — 2) 
at tlort QR +05] + eels 


Substituting for z in the second equation yields, with some rearrange- 
ment, 


(4.4) yt /2C3(C1 + C2) 
C,C2R 


and hence 


(4,5) mee 
Ci(C, + C2) 
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Before substituting numerical values in such results, it is advisable 
to perform a check on the dimensions, The fundamental quantities in 
the problem are time, which we will denote by [7], numbers of engines, 
[N], and money, [$]. Since @; and Cz are costs per engine per day, 
their units are [$]/[N][7]. Cs is a fixed cost, and the units are simply 
[8]. 2 is a number of engines per day; so the units are [N]/[T]. z and 
t are, respectively, numbers of engines, and time with units [N] and 
[7]. 


2C3(C, + C. 
The units of 2Ca(C1 + Ca) 


C,CoR 
is] 
2 (can _ (sPINFIT® 


( ra Van) Fa) ($PINP IZ] 


= [T}’, which are the units of 
202C3R 
C1(Ci + C2) 


(ain) en) _ (saver 
2 2 
Gino) 


= [N/, which are the units of z? 


The units of 


Having satisfied ourselves that the dimensions of equations (4.4) and 
(4.5) for z and ¢ are correct, we can substitute the numerical values: 


C, = $16/30 per day per engine 

Cz = $10 per day per engine 

Cz = $10,000 

R = 25 engines per day 
We obtain ¢ = 40 days, and z = 943 engines. ' 

Thus it would be better to start a new batch approximately every 
6 weeks, rather than every month, 

The average cost per day is 


1G? , Ca(Rt- 2)? Te 
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with ¢ = 40 and z = 943. If we start a batch every month, then, from 
Example 2, z is 712, and the average cost per day is F(30,712) = $523. 
There is thus a saving of about $20 a day to be obtained by optimizing 
z and ¢ simultaneously. 


PROBLEMS FOR SOLUTION 


Set V 


1. An aircraft company uses rivets at an approximately constant rate of 
5000 pounds per year. The rivets cost $2.00 per pound, and the company 
personnel estimate that it costs $20.00 to place an order, and that the carrying 
cost of inventory is 10% per year. 


(a) How frequently should orders for rivets be placed, and what quantities 
should be ordered? 

(b) If the actual costs are $50.00 to place an order and 15% for carrying 
cost, the optimal policy would change. How much is the company losing per 
year because of imperfect cost information? 


2. A manufacturer receives an order for 6890 items to be delivered over a 
period of a year as follows: 
At end of week 1: 5 items. 
At end of week 2: 10 items. 
At end of week 3: 15 items. 
ete. 
The cost of carrying inventory is $2.60 per item per year, and the cost of a 
setup is $450 per production run, 


(a) Compute costs for the following policies: 


(1) Make all 6890 at start of year. [Ans. About $12,000] 
(2) Make 3445 now and 3445 in 6 months, [Ans. About $8000] 
(3) Make 1/12 of the order each month. [Ans. About $9000] 
(4) Make 1/52 of the order each week. [Ans. About $27,000] 


(6) Attempt to devise a more rational policy. 


3. A manufacturing concern has a fixed cyclic demand, with a period of 
one week, as follows: 


Monday 9 
Tuesday 17 
Wednesday 2 
Thursday 0 
Friday 19 
Saturday 9 
Sunday 14 


Company policy is to maintain constant daily production. The production 
and shipping departments work seven days a week, and each day’s production 
is available for shipment on the following day. 


Inventory, |p eee 81 


If a shortage costs four times as much per day as a surplus of the same 
amount, how much stock should be on hand at the start of business on Monday? 
[Ans. 10] 


4, The weekly production of a given item is normally distributed with mean 
10,000 and standard deviation 200. The demand is normally distributed with 
mean 10,500 and standard deviation 1000. Overtime on Saturday must be 
scheduled in advance and will produce an average of 500 items normally dis- 
tributed with standard deviation 20. Each item made in overtime costs an 
extra $8.00 in production costs. Each item short at the end of the week costs 
$10.00 (lost sale). Overproduction costs $5.00 per item and cannot be used to 
fulfill demand in succeeding weeks (e.g., it is immediately sold at a loss). Use 
Monte Carlo methods to determine whether overtime should be scheduled or 
not. 


PROBABILISTIC DEMAND 


As we have already pointed out, practical situations in which we 
know future demand precisely are somewhat unusual. For this reason 
the examples already given may appear to be artificial. In this section 
we assume less information about future demand: we assume that the 
probability distribution of future demand is known, rather than the 
exact value of the demand itself. The methods used are analogous to 
those already discussed; and familiarity with the examples already 
given will aid the reader in following the discussion. 

The probability distribution of future demand can often be deter- 
mined from past experience. Two classes of distribution are distin- 
guishable in theory, although in practice they tend to merge, and 
either may be used as an approximation to the other when it is con- 
venient. When we are considering goods, such as automobiles, which 
can only be produced in discrete quantities, we usually think of orders as 
having a discrete probability distribution. On the other hand, demand 
for water from a utility company can vary in a continuous manner, and 
a continuous probability density function might be more appropriate 
for describing such demand. ‘ 

However, the manufacturer making thousands of automobiles per 
year might well decide that one car was such a small unit of his produc- 
tion that for practical purposes the demand could be described by a 
continuous distribution. Conversely, it might be convenient for the 
utility company to measure demand in thousands of gallons and to use 
a discrete probability function. : it 

In situations where future demand is known only in a probabilistic 
sense, it is not possible to choose policies that will insure the minimiza- 
tion of actual costs. Instead we choose policies that will minimize the 
expected costs. Expected costs are found by taking actual costs cor- 
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responding to each particular situation that could arise, multiplying 
by the probability of occurrence of that situation, and then either 
summing or integrating as is appropriate. 


Example 4 

An automobile dealer has the option of ordering once every week 
or once every two weeks. He finds that the cost of holding a car in 
stock for a week is $6 (insurance, minor deterioration, interest on 
borrowed capital, etc.). Customers who cannot obtain new cars im- 
mediately tend to go to other dealers; and he estimates that for every 
customer who cannot get immediate delivery he loses an average of 
$100. For one particular model of car the probabilities of a demand 
of 0, 1, 2, 8, 4, and 5 cars in a week are 0.05, 0.10, 0.20, 0.30, 0.20, and 
0.15, respectively. Every time the dealer places an order, he has fixed 
costs of $40. How often should he order, and to what level should he 
bring his stocks? (Assume that there is no time lag between ordering 
and delivery.) 


SoLuTION 

We first assume that the dealer orders every week, and that each time 
he places an order he brings his stock level up to z. We then find z so 
as to minimize his total expected costs during the week. Similar cal- 
culations based on an assumption that he places an order every two 
weeks will enable us to compare the total expected costs per week for 
the two ordering policies, We can thus decide which policy will result 
in lower expected costs. 

We will require the following notation: 


t = interval between orders 
n = demand for cars in the time interval t 
pi(n) = probability density of n when ¢ is 1 week 
(n = 0, 1, 2, 3, 4, 5) 
p2(n) = probability density of n when t is 2 weeks 
(n = 0, 1,2, +++, 9, 10) 
C, = holding cost per week for each car in stock 
C2 = cost of failing to deliver a car to a customer on demand 
C3 = cost of placing an order for cars 
2 = stock level immediately on receipt of a shipment of cars 
F(z) = expected cost (excluding the ordering cost C3) associated 
rs a policy of ordering every week to bring the stock level 
02 
F(z) = expected cost (excluding the ordering cost C3) associated 


with a policy of ordering every 2 weeks to bring the stock 
level to z 
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The inventory situation in a typical interval t, between orders, is 
shown in Figure 4.5. We will assume that, if there is a demand for n 
cars in the period ¢, the demand arises at times 

t 2t 3t nt 
A tilnenttaly ductile mated 


Since we are told that deliveries to the dealer take place immediately 
on request, each period ¢ starts with an inventory z. Figure 4.5 illus- 


n<z 


Figure 4.5. Inventory situation for Example 4. 


trates the two situations: n greater than z; and n less than z. In both 
cases the number of automobile-weeks is represented by the cross- 
hatched area. 

For n > z, the cross-hatched area is t2(z + 1)/2(n + 1), and thus the 
cost of holding inventory is Cylz(z + 1)/2(n + 1). The dealer will fail 
to supply n—z customers, and this will cost him C2(n—z), so that the 
total cost is 

Cytz(z + 1) 
2(n + 1) 

For n <2, the cost of holding inventory is Cit(z — 7/2), and all 
customer demand is satisfied. 

The total expected cost F;(z) for the period ¢, excluding the ordering 
cost C3, is obtained by multiplying the cost associated with demand n 
by the probability of n, and summing over the appropriate range of n .* 


z n < Cyta(z + 1) 
F(z) = > Cyt (: = *) pin)+ Do eorerp 


n=e+1 


+ C2(n — 2) 


+ C2(n — a] pi(n) 


*The symbol will be used in summations and integrals to indicate that the 
range is to extend until all further probabilities are zero. 
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For the minimum value of F,(z), we require that z satisfy the rela- 
tionship 


(4.6) AF (2 — 1) <0 < AF;,(z) 


where AF’(z) is the first difference of the function F;(z), as defined in 
Appendix I. To obtain AF,(z) we zefer to Appendix I. From the cri- 
terion given there we see we can difference under the summation sign, 
provided it is true that 


n Cite + 1)(2 + 2) 
cule +1=— 4) pin) = aaa. + Cy(n — 2 — ») pin) 


when 7 has the value of z+1. Since both sides have the value of 
(Cit/2) (2 + 1) p(@ +1), the requirement is satisfied, and we may 
difference under the summation sign. 


ud <. [Cite + 1) 
AF (2) “2. Cyt p(n) + si = cs pi(n) 
or 
(4.7) 
u = “ pi(n) 
AF((2) = (Cit + C2) p(n) + CH#@ +1) —* - Cy 
n=0 n= in+ 1 


The calculation of the optimal value of z from equation (4.7) is 
shown in Table 1. We are given the values of pi(n) (the distribution 
of demand corresponding to t = 1), and as a preliminary we have to 
compute p(n) (the distribution corresponding to ¢= 2). If it is 
reasonable to assume that sales in consecutive weeks are mutually in- 
dependent, then by the methods of Chapter 2, we find that 


P2(n) = oD pi(m) pi(n — m) 
m=0 


The calculations appear in Table 1. 

Thus, if the policy is to order every week, the stock level should be 
brought up to five cars; if orders are placed every two weeks, the stock 
level should be eight cars. 

To answer the question posed in the statement of the problem, we 
must compare the total weekly costs associated with the two optimal 
ordering policies. This could be done from the formula for F(z), but it 
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A, Orpen Every WEEK [f = 1, C; = 6, C2 = 100] 


pa(n) > pi(n) > Pan) 


n,2 pr(n) n+1 ne n=etin+1 AF (z) 
0 0.05 0.05 0.05 0.2567 —93.15 
1 0.10 0.05 0.15 0.2067 —81.62 
2 0.20 0.0667 0.35 0.140 —60.48 
3 0.30 0.075 0.65 0.065 —29.54 
4 0.20 0.04 0.85 0.025 —9.15 
5 0.15 0.025 1.00 = +6 


B. Orper Every 2 Weeks [t = 2, C, = 6, C2 = 100) 


pin) = p(n) 
ne pn) RET Aare AP) 


0 0.0025 0.0025 0.0025 0.1592 -—97.81 
1 0.0100 0.0050 0.0125 0.1542 —94.90 
2 0.0300 0.0100 0.0425 0.1442 —90.05 
3 0.0700 0.0175 0.1125 0.1267 81.32 
4 0.1200 0.0240 0.2325 0.1027 67.80 
5 0.1750 0.0292 0.4075 0.07385 —49.07 
6 0.2000 0.0286 0.6075 0.0449 —28.19 
7 0.1800 0.0225 0.7875 0.0224 —9.65 
8 0.1300 0.0144 0.9175 0.0080 +3.62 
9 0.0600 0.0060 0.9775 0.0020 +9.72 
0 0.0225 0.0020 1.0000 = +12.00 


_ 


is probably quicker to compute /',(0) and then use the first differences 


already tabulated. 
F,(0) = Con pi(m) = 295.00 
n=l 


4 
F,(5) = F100) + Lo AFA) 
n=0 
= 295 — 273.94 = 21.06 


F (0) = d C2n pa(n) = 590.00 


n=l 


7 
F,(8) = F2(0) + 2 AF a(n) 
= 590.00 — 518.79 = 71.21 


86 _______ -- Operations Research: Methods and Problems 


With a policy of ordering every week so as to bring the stock level 
up to five cars, the weekly cost (exclusive of ordering) is $21.06; thus 
the total cost per week associated with ordering every week is $21.06 + 
$40.00, or $61.06. With a policy of ordering every two weeks, so as to 
bring the stock level up to eight cars, the cost per fortnight (exclusive 
of ordering) is $71.21; the total cost per fortnight is $111.21, or an 
average of $55.60 per week. There is thus a saving of $5.46 to be 
obtained by ordering every two weeks, 


Example 5 

The raw material for the manufacture of bearings is metal-plated 
steel strip, Plating is done in a continuous process, and the plating 
machine is such that there is no setup cost in changing to a strip of the 
same width but different thickness; there is a setup cost of $9.00 in 
changing to a strip of a different width. There are just two types of 
strip which are 6 inches wide: strip type A, and strip type B. Thus 
there is no setup cost in changing from strip type A to strip type B, but 
there is a setup cost of $9.00 in changing from any other strip type to 
either A or B, Knowing this, the manufacturer always runs strip A 
and strip B consecutively. His problem is how frequently to schedule 
the production of the 6-inch strip. 

Analysis of past sales shows that the weekly requirement (hundreds 
of feet per week) for strip A is distributed in a Poisson fashion with 
mean 3, while for strip B the requirement is Poisson with mean 4, The 
inventory holding cost for strip A or B is 25 cents per hundred feet per 
week. If plated strip is not available for the next production process 
when required, scheduled customer delivery dates may be missed, and 
the sales department estimates that this will cost the company an 
amount directly proportional to both time and quantity. The cost is $8 
per hundred feet per week late, for either A or B. The production 
process is sufficiently rapid that the time between the setup to produce 
plated strip and the receipt of strip into inventory can be neg- 
lected. 

If the period between Setups is to be some fixed time interval, how 
frequently should setups of 6-inch strip be made, and what quantities 
should be produced? 


SoLution 

The approach here is to compute the minimum cost per week when 
the interval between production runs is chosen in advance, to use the 
points thus obtained in order to arrive at a smooth curve representing 
the variation of minimum cost per week with interval between produc- 


Inventary 28 shorlioM slowese9H woltermcl) = OF, 


tion runs; then, by interpolation, we may find the best interval between 
runs and the corresponding run sizes. 
We will use the following notation: 


Ai, A2 = mean weekly demand (in hundreds of feet) for strip A and 
strip B, respectively 
C, = cost per hundred feet per week of holding stocks of A (or B) 
Cy = cost per hundred feet per week of failure to meet demand for 
A (or B) 
A;(x) = probability of a demand z in time tfor A 
eM)? Pee 
= Recieve since the demand is Poisson 
B,(x) = probability of a demand z in time ¢ for B 
e*2!(Not)* 


z! 


since the demand is Poisson 


We will first consider the costs of holding inventory and of shortage 
during the period ¢ between two production runs, These costs can be 
considered separately for A and B. 


Figure 4.6. Inventory situation for Example 5. 


Let F(z) be the expected inventory and shortage costs during ¢ for 
either A or B alone. There are two possible inventory situations 
according as the demand z is greater or less than the starting stock, z, 
as pictured in Figure 4.6. We find, for both A and B, 

2 Cot(x — 2 
Cite «i 'ot(a — 2) Yee 


Cost = 
oe 22 2a 


x 5 
Cost = cxe(« ~ 5) if, a <z 
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The expected cost will not be the same for both A and B, since the 
probability distribution of « differs for the two strip types. Thus 


F( for A = cud  (: = =) A) 


Cite? A,(2) 4 ot 2. (z — 2)? A,(z) 
= § 2 ae z aay a, = 
F(z) for B = Cyt p» (= - =) B,(z) 
Cytz? Biz) 4 oat 2. (« — 2)? Bi(z) 
* eee z Bor ren z 


We apply the criterion of Appendix I, paragraph 6, and find that 
we may difference both functions under the summation sign. The 
results are: 


z oo A 
AF) for A= (Cy + C2) | AV@) + («+ 3). = a0] ~ Cnt 
z=0 


zezfl & 


(4.8) 
a Bi 
AF (2) for B = (Cy + C2)t [= B,(a) + («+ >) > ua Gt 
tme+ 


The criterion for a z that minimizes the cost for product A, which is 
AF(z — 1) for A <0 < AF(z) for A 
may be restated as: Choose the least value of z that satisfies 
w l\ S Aja) Co 
Ai) +(2-+5) Seeeee 
z ! 2 =. x Ci + C2 


A similar expression, with A;(x) replaced by B,(x), gives the criterion 
for a z which minimizes the cost for product B. 
To obtain the expected cost associated with a particular z, we use 


F(2) = FO) + ZAre) 


Since F(0) for A = — cts: 2 A(z) 


z=1 


Cot C. 
= = Be for A) = (a) dit 
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we have 


CgPapanaae 
(4.9) F(®) for A = ——— + > [AF(u) for A] 
2 u=0 
Similarly, 
; 50%, 4 
(4.9) F(2) for B = = + > [AF(u) for B] 
u=0 


The calculations are best tabulated. The tabulation for strip A with 
t = 1 are shown in Table 2, and the results for the remaining cases are 
presented in Table 3. 


TABLE 2. Cosrs (Exctupine Setup) ror Strip A—Propuction 
ScuEpuLED Every WEEK 


A(z) : “ ~ A(i) z ’ 1 2 A(i) 
vA x & ay) an a 240 zu (: + _) wear 
0 0.05 20 0.05 0.4114 0.2557 

1 O45 0.15 0.20 0.2614 0.5921 

2 0.23 0.115 0.43 0.1464 0.7960 

3 0.22 0.0714 0.65 0.075 0.9125 

4 0.17 0.0425 0.82 0.0825 0.9663 

5 0.10 0.02 0.92 0.0125 0.9888 <— 

6 0.05 0.0083 0.97 0.0042 0.9973 

7 0.02 0.0029 0.99 0.0013 0.9998 

8 0.01 0.0013 1.00 0 1.0000 


Since Co/(Cy + C2) = 0.97 here, the value of z that minimizes cost 
if production is scheduled every weekis 500 feet. Thusthe minimum cost 
(excluding setups) per week is F(5), which on evaluation turns out to 
be 1.06. Ina similar manner, Table 3 can be derived: 


TABLE 3 

Interval Total 

between Optimum stock Cost (excluding costs 
production level Z, ft. setup) perrun Cost (including _ per week, 
runs, 49 ————-_ —————_ setup) per run, A and B 
weeks A B A B Aand B together together 

1 500 600 1.06 1.32 11.38 11.38 

2 800 1000 3.36 3.43 15.79 7.90 

3 1100 1400 6.69 7.94 23.63 7,88 


It is seen that the total cost per week is a function of the interval 
between production runs. We now wish to determine the interval that 
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minimizes this function. A fairly simple method of finding a numerical 
result is to fit a curve to the values of the function we already know and 
to interpolate. 
Let h(t) be the minimum total costs per week when setups are at 
intervals of t weeks. We have h(1) = 11.38, h(2) = 7.90, h(3) = 7.88. 
Now, as the interval ¢ between set- 
ups decreases, the cost of setups per 
week will increase; so it is reason- 
able to assume that h(é) tends to 
infinity as ¢ tends to zero. 

The simplest curve that will pass 
through three given points and ap- 
proach infinity at the origin is h(t) = 
A/t+ B+ Ct (see Figure 4.7), By 
equating coefficients, we find 


3 A= 104 
& = -0.74 
g C=172 
dh —A 
pale Oa BP 5 
awa w 


for a minimum. Therefore 


(lie a camer aaa apa Sa P=—=—_— = 6.04 


Figure 4.7. Choice of interval t be- Lig t=2.5 
tween setups. . 


Thus the minimum costs would be 
achieved by scheduling every two and one-half weeks, By a straight- 
line interpolation in Table 3, we find that 950 feet of A and 1200 
feet of B should be scheduled. 


Example 6 

The situation is the same as for Example 5, except that the inventory 
levels of the 6-inch strip type are to be reviewed every week. Each 
week a decision will be made whether or not to produce. If production 
is to take place, there will be a second decision on the amount. Give 
decision rules that will minimize expected costs. 


SonuTion 
Since inventory is being reviewed every week, it is clear that, once 
we decide to produce the 6-inch strip, we should bring the inventory of 
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strip A and strip B up to the levels that minimize expected costs to be 
incurred before the next weekly review. The optimum stock levels, 
when production runs are made at weekly intervals, are listed in Table 
3 of the previous example; they are 500 units for strip A and 600 
units for strip B. 

If we decide to produce 6-inch strip, the expected cost during the 
week will be the sum of 


(a) the cost for A associated with inventory and shortage when the 
week starts with an optimal stock level. From Table 3, this cost is 1.06. 

(b) the corresponding cost for strip B. From Table 8, this cost is 
1.32, 

(c) the setup cost of 9.00. 
or a total of 11.38 in all. 


If we decide not to produce 6-inch strip, we incur expected costs equal 
to the sum of 


(a) the cost for A associated with inventory and shortage when the 
week starts with the existing stock level. 

(b) the cost for B associated with inventory and shortage when the 
week starts with the existing stock level. 


Let y be the existing stock level of A, and z be the existing stock level 
of B. Denoting expected cost for A associated with an existing stock 
level of y units by a(y), and the expected cost for B by (2), then we 
will decide to produce if and only if 

a(y) + B(2) > 11.38 
In order to use this decision rule, we should have on hand the values 
a(u) and 6(u) for the various possible values of u. These values are 
computed most easily by use of formulas (4.9) of the previous example, 
where the various first differences are obtained from formulas (4.8) and 
Table 2. The results are: 


u au) Blu) 

0 12.00 16.00 
1 6.11 9.34 
2 2.99 5.22 
3 1.56 2.89 
4 1.09 1.76 
5 1.06 1.35 
6 1.32 


Thus, if our existing stock level was zero for strip A and 100 feet for 
strip B, the decision would be: Produce 6-inch strip this week (12.00 + 
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9.34 > 11.38), But, if we had on hand 300 feet of strip A and 100 feet 
of strip B, the decision would be: Do not produce 6-inch strip this week. 

When the stock levels are not integral multiples of 100 feet, applica- 
tion of the decision rule requires some form of interpolation. A conven- 
ient way of making the decision rule “continuous” is by a diagram such 
as that shown in Figure 4.8. Definite decisions are available for the 


500 


Stocks of A, feet 
sos Ee 


8 


tt) 100 200 300 400 500 600 
Stocks of B, feet 


x Produce 
© Do not produce 


Figure 4.8. Decision rule without discreteness restriction. 


80 points whose positions are indicated by zeros and crosses. A cross 
means “Produce” and a zero “Do not produce.” The solid dots are 
obtained by direct interpolation from the table, and the curve drawn 
through these solid dots divides the yz plane into the two areas “Pro- 
duce’ and “Do not produce.” 

It could happen that, on starting an inventory policy of this type, 
initial stocks exceeded the optimum levels. In this case stocks could be 
reduced to the optimum levels before the scheme started to operate. 
Once stocks were below optimum levels, they would never again rise 
above them. 


It is obvious that in Examples 5 and 6 we are discussing essentially 
the same situation. The only difference is in the assumptions made 
about the way the decision-making process will be organized. Although 
it is not the purpose of this book to discuss the broader aspects of 
analysis of management decision processes, the reader is reminded that, 
in a real situation, much thought might be required before deciding 


which model most closely approximated to the actual organization. It 
often happens that it is necessary to analyze two or more models, and 
to discuss their implications with management before obtaining an 
answer to a real problem. The student is cautioned that there is no 
such thing as the correct solution to a practical problem; even in text- 
book discussions there may well be more than one approach which 
yields a sensible result. 


PROBABILISTIC DEMAND WITH 
PRODUCTION LEAD TIME 


In this section we consider the effect of an appreciable lead time 
between the decision to produce (or order) for inventory and the 
arrival of goods into inventory. The main difference between the 
calculations of this and the previous section is that here we have to 
consider the depletion of inventory between the date at which a decision 
is made and the start of the period for which costs will be affected by 
the decision. 


Example 7 

An airline runs a school for air hostesses each month; it takes two 
months to assemble a group of girls and to train them. Past records 
of turnover in hostesses show that the probability of requiring x new 
trained hostesses in any one month is g(x) [x = 0, 1, 2, ---], and the 
probability of requiring y new hostesses in any two-month period is 
h(y). In the event that a trained hostess is not required for flying 
duties, the airline still has to pay her salary at the rate of C, per month. 
If insufficient hostesses are available, there is a cost of C2 per girl short 
per month. Show how to determine decision tules for the size of 


classes. 


SoLuTIon 

Suppose that it is the end of a month and the personnel manager has 
to decide how many new girls should be trained so that they become 
available for flying duties in two months’ time. He will have the follow- 
ing information on which to make his decision: 


(a) the number of girls surplus to requirements, as at the end of the 
month (possible negative). 

(b) the number of girls presently in training who will be available 
for flying duties on the first of the following month. 

(c) the number of girls presently in training who will be available 
for flying duties on the first of the second month hence. 
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Let z = the sum of (a), (b), and (6) plus the number of girls he 
schedules to start training. Then, if there were no staff losses, z girls 
would be surplus to requirements in two months’ time. The personnel 
manager has to make a decision about the size of z which will minimize, 
in some sense, the expected cost of surplus staff (including negative 
surplus, or shortage). 

His decision cannot affect the situation in the next two months, 
because the existing surplus and trainees are not affected by his deci- 


B.0SySzx>2-y; Cy>z 


Figure 4.9. Inventory situation for Example 7, 


sion, and none of the new trainees can be surplus in the next two 
months. The new trainees will affect the situation in the third month, 
but the costs during the fourth month will depend on the decision to be 
made a month hence. Thus the present decision should be aimed at 
minimizing costs during the third month, 

Figure 4.9 shows possible inventory situations during the third 
month. In that figure, « represents personnel losses during the third 
month, y represents personnel losses in the first two months, and z is 
the decision variable as defined above. There are three possible situa- 
tions, depending on the relative values of z, y, and z. In all three 
cases the cross-hatched area represents “inventory months” and the 
dotted area represents “shortage months.” We are assuming that, 
demand for new girls arises uniformly during the month. 

In situation A, the third month starts with an inventory z—y, and 
ends with an inventory z—y—z, so that the total (actual) cost is 


C: 
gle-9+@-v- a] =0,(2-y-2) 


| 
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In situation B, the third month starts with an inventory z—y and ends 
with a shortage z+y—z. As in Example 5, the area representing 
inventory cost is (1/2x)(z — y)?, and the area representing shortage 
cost is (1/2z)(x + y — z)?. The total cost is 


1 
3 Cie — y)? + Co(x + y — 2)* 


In situation C, the third month starts with a shortage y—z and ends 
with a shortage x+y —z, so that the cost is 


¢. 
3 lw-at@ty-9) - (5 +y-2) 


We now multiply by the joint probability of demands for y girls 
during the lead time (first two months), and « girls during the third 
month, and sum over the range of x and y. Assuming that the random 
variables « and y are independent, the joint probability density function 
is g(x) h(y). 

Let F(z) denote the total expected cost corresponding to a particular 
z. Then 


2 2-y 


x 
Fe = EA(2-v-Z)oe@ aw) 
y=0 z=0 2 
2 2 1 
+> DY Hleve-y)?+Coet+y —2)lg(e) hw) 
y=0 t—z—y+1 2x 
Ree ake is 
Tap SEP G +y- 2) ae) h(y) 
y=z+1 2=0 2 
In order to find the value of z that minimizes F'(z), we require AF(z). 
The first difference of this function F(z) is worked out in Appendix I, 
paragraph 6, as an illustrative example. The result given there, with 
slight changes in notation and form, is 


Paths 
AF(2) = DL C1 g(a) hy) 


y=0 z=0 
z @ 1 
+O DY jl - 2y +1) 
y=0 t—z—y+1 2a 


+ Co(22 — 2x — 2y + 1)] g(x) h(y) 


- DL ZX Cog(z) hy) 


y=z+1z=0 
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This result can be simplified considerably by the use of the relations 
Dalz) = 1, Dh) =1 
z=0 y=0 


which hold because g(x) and h(y) are probability densities. After 
simplification, AF'(z) has the form 


2 peg Ly ote) 
AF) = (C+ CE MW) [ Dawe) + (e-v+ ed ao 


The minimum value of F(z) will be given by the smallest value of z 
for which AF(z) > 0: i.e. by the smallest value of z for which 


Saw) | Doe) +(2-v +5) > a0) » C2 


v=0 z=0 zez—y+1 & Si Cy + C2 


The number of girls to be called for training is then this value of z 
minus the sum of the quantities (a), (), and (c) defined in the first 
paragraph of the solution. 


Example 8 

A dealer in industrial solvents has one product which is made 
especially for him. He places orders for supplies every month and 
receives delivery one month later. In the event that his tank capacity 
is insufficient to hold all the quantity ordered at the time of delivery, 
the balance of the order is wasted at a cost of k per gallon wasted. 

Customer demand for one month can be represented by the proba- 
bility density function p(x); and demand in one month is independent 
of demand in any other month. The cost of storing the solvent is C; per 
gallon per month, and the cost of a shortage is C2 per gallon per month. 
In addition, there is a fixed cost per month of KZ , where Z is his total 
storage capacity. 

If the criterion is minimum expected total cost, discuss the determi- 
nation of 


(a) the amount to be ordered each month with present storage 
capacity. 
(b) the best value for total storage capacity. 


SoLuTIon 
Let Z be the storage capacity, and z the amount in the tanks plus the 
amount on order immediately after an order is placed. 
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Suppose that there is a demand y in the first month following the 
order and a demand z in the second month. As in the airline hostess 
problem the decision about z will affect costs only in the second month 
following the order. 

We have to consider the following possibilities. 


(1) 2 -—y > Z. In this case an amount z—y—Z will be wasted, and 
the second month will start with Z in storage. 

(2) 0<z—y<4Z. In this case there will be no waste, and the 
second month will start with z—y in storage. 

(83) 2 —y <0. In this case there will be no waste, but the second 
month will start with a shortage left from the previous month. 


The costs of inventory and shortage can be derived by reasoning 
very similar to that used in the airline hostess problem. In case 1 the 
costs are 


o:(2-3) + Ma 9-2) it e<z 
2, pe. 2 
oe ae +ke-y-Z) if c>Z 


In case 2 the costs are 


co 


¢. . 
og y+ Sety—9 it e>emy 


In case 3 the costs are 
x 
ca (5 y= :) 


In order to obtain the total expected costs, we multiply by the 
appropriate probability density functions and integrate with respect to 
x and y. We must also add the fixed storage cost KZ. 
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We will use F(z, Z) to denote the result. 


F(@, Z) = frat [ix Pty) [o (2 = ) tg artetom | awl 
oe ‘L270 Pu) ea C12? + Co(x — 2)?] 


+e y-2)] deay 
+f a fo) vin [or (e-»- 5) | aeay 


+f fv p(y) { >. (ele & 5) 


+ Oxley — 2)4 deay 


+f" fx p(y) [es ( fi bes 2) aaa, 
+ KZ 


If the storage capacity Z is held at its present level, F(z, Z) is a 
function only of the quantity z. Call this function G(z). In order to 
minimize this function, we require the derivative dG/dz. 

Appendix IT discusses differentiation under the integral sign. Appli- 
cation of equations (11.6), (11.7), and (II.8) of Appendix IT and some 
manipulation yields 


dG 
(4.10) qe (Cath AG-Z2-c, 


++ Od fwd —» + Be lay 
where we define + 


AQ) = [p00 aw 
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In order to obtain the optimal value of z, we set the right-hand side of 
(4.10) equal to zero and attempt to solve the resulting equation for z. 
Trial-and-error methods using numerical integration would give usable 
results for the decision variable z. 

If the storage capacity Z is also considered to be a decision variable, 
F(z, Z) is a function of two variables z and Z. After differentiating 
F(z, Z) and simplifying as above, we end up with two equations to be 
solved numerically for z and Z: 


Ct WAE-D=~G+G+CDf pwlAG-w 
dz s—Z 

+ Be-y)dy =0 
H aK M+ OAD + BO)- C+ IAG -2 =O 


PROBLEMS FOR SOLUTION 
Set VI 


1, A baking company sells cake by the pound. It makes a profit of 50 cents 
a pound on every pound sold on the day it is baked. It disposes of all cake 
not sold on the date it is baked at a loss of 12 cents a pound. If demand is 
known to be rectangular between 2000 and 3000 pounds, determine the optimal 
daily amount baked. [Ans. 2807 pounds] 

2. The probability distribution of monthly sales of a certain item is as 
follows: 

Monthly sales Probability 

0.02 
0.05 


Qanrwnre Oo 
eeces 
Ssesyxge 


1.00 


¢ of carrying inventory is $10 per unit per month. The current policy 
Fie pent a stock of four items at the beginning of each month. Assuming 
that the cost of shortage is proportional to both time and quantity short, 
obtain the imputed cost of a shortage of one item for one time unit. (Because 
the problem is stated in discrete units, the answer will consist of a range of 


values for the imputed cost.) 
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3. A newspaper boy buys papers for 3 cents each and sells them for 7 cents 
each. He cannot return unsold newspapers. Daily demand has the following 
distribution: 


No. of customers 23 24 25 26 27 28 29 30 31 382 
Probability -01 .08 .06 .10 .20 .25 .15 .10 .05 .05 


If each day’s demand is independent of the previous day’s, how many papers 
should he order each day? 


4. A manufacturer of a perishable commodity is limited, because he takes 
advantage of bulk transportation, to one shipment a month from factory to 
warehouse. This shipment takes place on the first of each month, and includes 
all the previous month’s production. There is one production run a month, 
which starts at the beginning of the month and continues at the rate of k units 
per day until the monthly production quota of g units is finished. (Here k is 
fixed, and q is a decision variable.) Production costs C per unit. 

Goods are sold only from the warehouse. The probability of a demand on 
the warehouse for a quantity n during the month is p(n); it may be assumed 
that the demand on the warehouse is spread uniformly over the month. Stor- 
age, either at the warehouse or at the factory, costs Cy per unit per month, 
Any warehouse surplus at the end of the month is considered stale, inasmuch 
as it is over a month old, and is disposed of through outlets which will take an 
unlimited amount of stale goods, but at a price P’ (less than C) per unit. The 
sales price of fresh goods (i.e. goods received at the warehouse on the first of 
the current month) is P per unit. 

Find the expected profit per month as a function of the monthly production 
quota q, and, by differencing this function, determine the condition for optimal 
q. 


5. Discuss the problem of the automobile dealer (Example 4 of the text) if 
the assumption of immediate delivery from the factory is replaced by an as- 
sumption that delivery takes place two weeks after the dealer places an order. 


6. A company owns two warehouses X and Y. Shipments from the factory 
are made on the first of each month to bring the stock at X up to S; and the 
stock at Y up to S:. Demand during the month is uniform at each warehouse 
with a density function f(x) at X and g(y) at Y, and with z and y independent, 
Inventory holding costs are C; per unit per month and shortage costs are C, 
per unit per month, at either warehouse. In the event of a shortage at one 
warehouse, deliveries are made from the other warehouse, but only provided 
the latter is certain that its own orders for the month can be taken care of out 
of the stock that remains. However, it costs an extra amount & per unit to 
make a delivery from X to a customer of Y, and vice versa, It may be assumed 
that, by the time one warehouse has run out of stock, the other wareho' 
can predict its total demand for the month exactly. ‘al 
Show how to find values of S; and S2 which will minimize the total expected 
cost. 


Hint: The achieved total cost is a function of the Tealized values of x and y. 


DES EN ee 101 


The form of the function is different in different areas of the zy plane. There 
are six such areas, viz.: 


A: O0<2e<h, 0<y<s& 
B: z> Si, O<ySSi+&%—-2 
Cc: y= Ss, O<2<Si4+h&2—-y 
D: O0<e<S, y>S+82—-—2 
BE: z>S, y= S82 
PF: 0<y<&, z>Si+8:-—y 


Find the functional form in each area, multiply by the joint density function 
of x and y, and integrate to obtain the total expected cost as a function. of 
S, and S2.] 


7. This problem is identical with problem 6, except that the trans-shipment 
assumption is revised to read as follows: In the event of a shortage at one 
warehouse, deliveries are made from the other warehouse until such time as 
both warehouses are out of stock, 
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CHAPTER 


Replacement 


Replacement theory is concerned with situations in which efficiency 
tends to worsen in time, and in which it can be restored to a previous 
level by some kind of remedial action. The problem is to determine 
the times at which such remedial action should be taken, in order to 
minimize some appropriate measure of effectiveness. 

In this chapter we consider in turn the replacement of capital equip- 
ment that deteriorates with time, group replacement of items such as 
light bulbs which fail completely, some problems in mortality and 
staffing, and finally some examples of replacement situations which do 
not fit into these three categories. 


CAPITAL EQUIPMENT THAT 
DETERIORATES WITH TIME 


A typical example of such replacement situations is the problem of 
when a fleet owner should replace a truck. A truck wears out with age, 
and the problem is to discover the “best” time at which to purchase a 
replacement. Before we attempt to analyze such situations, let us first 
consider the truck owner and find out why he possesses a truck. We 
may suppose that one reason is to transport goods as cheaply as pos- 
sible, but there may be other reasons. Thus the owner might feel that 
a new truck lends prestige to his business. 
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If we concentrate on the cheapness of transportation, we could begin 
by examining the way in which costs increase with the age of the truck. 
With a fixed mileage every month we might obtain a graph as in Figure 
5.1. 

In addition to running costs there are the capital costs of purchasing 
the truck. These can be expressed as average costs per month, and the 
average monthly cost will decrease, the longer a replacement is post- 
poned. However, there comes a point at which the rate of increase of 


High repair expense 
Much lost time in shops 


Running cost per month 


Age in Months 


Figure 5.1. Monthly running costs for a truck. 


running costs more than offsets the savings in average capital costs. 
At this point replacement is justified. 


Example 1 
A fleet owner finds from his past records that the costs per year of 
running a truck whose purchase price is $6000 are as given below: 


Year 1 2 3 4 5 6 7 8 
Running costs 1000 1200 1400 1800 2300 2800 3400 4000 
Resale price 3000 1500 750 375 200 200 200 200 


At what age is a replacement due? 


SoLuTion 

We first compute a table (Table 1) showing the average cost per year 
during the life of the truck. The third column of the table gives the 
difference between the purchase price and the resale price at the end of 
the year in question. 

We conclude that the truck should be replaced at the end of the fifth 
year. If the truck is not replaced at this time, the average cost per 


year will start to increase. 
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TABLE 1, Costs or Ownine AND OPERATING A TRUCK 


Replace atend Totalrunning Total capital Total Average cost 


of year costs costs costs per year 
1 1,000 3000 4,000 4000 
2 2,200 4500 6,700 3350 
3 3,600 5250 8,850 2950 
4 5,400 5625 11,025 2756 
5 7,700 5800 13,500 2700 
6 10,500 5800 16,300 2717 
Example 2 


The truck owner of the previous example has three trucks, two of 
which are two years old and the third one year old. He is considering 
a new type of truck with 50% more capacity than one of the old ones 
at a unit price of $8000. He estimates that the running costs and resale 
price for the new truck will be as follows: 


Year 1 2 3 4 5 6 rg 8 
Running costs 1200 1500 1800 2400 3100 4000 5000 6100 
Resale price 4000 2000 1000 500 300 300 300 300 


Assuming that the loss of flexibility due to fewer trucks is of no im- 
portance, and that he will continue to have sufficient work for three of 
the old trucks, what should his policy be? 


SoLuTIon 

As in Example 1, we can show that the lowest average annual cost 
can be achieved by replacing one of the larger trucks every five years. 
With such a policy the average annual cost would be $3540 per truck. 
This is equivalent to $2360 per payload of the smaller trucks. Since 
this is less than the minimum average annual cost for one of the smaller 
trucks, the latter will be replaced by the new larger trucks. 

Having decided to replace the old-type trucks with the larger trucks, 
we still must decide when the new trucks should be purchased. We 
assume that for uniformity the replacement will involve two new trucks 
and all three of the old trucks. The new trucks will be purchased when 
the cost for the next year of running the three old trucks exceeds the 
average yearly cost for two new-type trucks. 

Examination of Table 1 shows that the total yearly cost for one 
smaller truck is $4000 in the first year, and $2700, $2150, $2175, $2475, 
$2800 in years 2, 3, 4, 5, and 6, respectively. Hence the total costs next 
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year for two smaller trucks aged two years and one smaller truck aged 
one year will be 
2 X 2150 + 2700 = 7000 


Tn subsequent years the costs will be $6500, $7125, and $8075 for years 
2,3,and4, The average annual cost for two larger trucks will be $7080, 
so that the costs for the old trucks will not exceed those for the larger 
trucks until the third year hence. Thus all three trucks should be 
replaced two years from now. 

Note that the old trucks will be replaced before any of them reaches 
the normal replacement age. 


In these two examples we have made the simplest possible analysis 
of the costs involved. In practice, we usually wish to consider not only 
the amounts of expenditures but the points in time at which they occur. 
We now give an example to illustrate how this can be done. 


Example 3 

A manufacturer is offered two machines A and B. A is priced at 
$5000, and running costs are estimated at $800 for each of the first five 
years, increasing by $200 per year in the sixth and subsequent years. 
Machine B, which has the same capacity as A, costs $2500 but will have 
running costs of $1200 per year for six years, increasing by $200 per 
year thereafter. If money is worth 10% per year, which machine 
should be purchased? (Assume that the machines will eventually be 
sold for scrap at a negligible price.) 


SoLurion 

We can regard the statement that money is worth 10% per year in 
two ways. First, we may suppose that any spare cash now available 
could be made to produce 10% return in a year, and so spending $100 
today would be equivalent to spending $110 in a year’s time. (If we 
plan to spend $110 a year from now, we could spend $100 today on an 
investment which would be worth $110 next year.) 

Alternatively, we might suppose that expenditures are made by 
borrowing money at 10% per annum. Again expenditure of $110 in a 
year’s time is equivalent to $100 today. If we spent $100 today, at the 
end of a year we would have to pay $110 to liquidate the debt. Viewed 
either way, $100 today is equivalent to $110 a year from now. 

Similarly, one dollar a year from now is equivalent to (1.1)~" dollars 
today, and one dollar r years from now is equivalent to (1.1)~ dollars 
today. This quantity (1.1)7 is called the present value or present worth 
of one dollar spent r years from now. 
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In order to compare A and B, we have to compare the present worth 
of the expenditure on the two machines, Instead of going through the 
arithmetic twice, it will be simpler to establish an algebraic formula for 
the present worth and to substitute numerical values later. 

Let C be the purchase price, and let 2, be the running cost in year n. 
Let i = rate of interest so that v = 1/(1 + 2) is the present worth of a 
unit to be spent a year hence. (v is known as the discount rate.) Sup- 
pose the machine is to be replaced after r years. Then assuming that 
expenditure can be considered to take place at the beginning of each 
year,* the present worth of the expenditure, which we will denote by 
P(r), is given by 


(5.1) P(r) =C+R, + oR, + v*Rs + FR, +---+ 0° OR, 


Now P(r) increases as r increases; so the present worth, if we replace 
after (r+-1) years, is greater than if we replace after r years. For the 
extra money we get an extra year’s service, and we wish to compare 
some function of the replacement interval which allows for this. 

In order to do this, let us suppose that replacement occurs in r years, 
and that the manufacturer sets up a fund P(r) to be invested at interest 
7, from which all expenditure on the machine will be made. Suppose 
that he obtains the sum P(r) by borrowing funds at the same rate 7, and 
he repays the loan by fixed annual payments throughout the life of the 
machine. In this way the variable payments actually made can be 
translated into fixed annual payments. We can use these fixed pay- 
ments as a basis for comparing replacement at the end of different 
periods. (In the previous examples we did much the same thing when 
comparing the average costs per year, ignoring interest.) 

The present worth of fixed annual payments z for r years is 


eters tt ytpe 
med 


Since this is to be the sum borrowed we see that 


(5.2) *=—— P(r) 


1-9 
Thus the best period at which to replace the machine is the period r 
which minimizes P(r)(1 — v)/(1 — 0”). We can simplify this criterion 
somewhat by noting that the function to be minimized is a positive 
constant times the function F(r) = P(r)/(1 — 0’). The period at 
which to replace the machine is the period r which minimizes F(r). 
* If this is unrealistic, the results still follow, provided Rp is modified to mean 
expenditure during the year discounted to the beginning of the year, 
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Since the possible values of r are discrete, we employ the methods of 
finite differences to calculate the optimal r. According to Appendix I, 
the condition on r for F(r) to be a minimum is 


(6.3) AF(r — 1) <0 < aF(r) 


subject to the usual checks on whether the resulting value of r actually 
provides an absolute minimum. We next evaluate AF(r): 


P(r) Ptextd)ino Rr) 
roe) pee eager 
P(r +1) — P(r) + oH P(r) — vo” P(r + 1) 
x. (l—v7)(1 — : 
vRea1 + (oH — oC + Tt — or) x YR; — R41 
jl 


AF(r) = a[ 


(1 — v')(1 — #4) 


_ OR (tt — 9") P(r) 
od. segtth (1 = o)\(1 — oT) 


Thus we have 


ov 10 
(5.4) AF(r) = joa (Fess = —*?,) 


Since v"/[1 — v” +] is always positive for |v| <1, AF(r) has the same 
sign as the quantity in brackets, and the condition (5.3) for the optimal 
replacement interval » becomes 
5.5 Fran aa ite alte Deter ra 
SS Pag aaa Aiea 
Returning now to the numerical values of the problem, we tabulate 
the results for machine A (Table 2). 


TABLE 2. Repiacement Costs ror Macuine A 
Year7 Running cost R, ov’! v—R, Pr) ((l-wv)/(L—d) JR 4 


Ry 41 a 24 


1 800 1.0000 800 5,800 800 
2 800 0.9091 727 6,527 1,528 
3 800 0.8264 661 7,188 2,189 
4 800 0.7513 601 7,789 2,790 
5 800 0.6830 546 8,335 4,170 
6 1000 0.6209 621 8,956 5,749 
vf 1200 0.5645 677 9,633 7,497 
8 1400 0.5132 718 10,351 9,390 
9 1600 0.4665 746 11,097 11,403 
0 


1800 0.4241 763 11,860 


~ 
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Thus it would be best to replace machine A after nine years. Ina 
similar fashion, the reader can verify that machine B is best replaced 
after eight years. The equivalent fixed annual charge, computed from 
(5.2), is $1752 for machine A and $1680 for machine B. We conclude 
that it would be better to purchase machine B. (The average of the 
actual payments is $1578 for A and $1588 for B.) 


In practice, there is a further complication in this type of replace- 
ment problem. The tax situation of a business is considerably affected 
by its capital equipment replacement policies. A discussion of tax law 
is outside the scope of this book, but in any real problem the incidence 
of tax cannot be ignored. 

It is of interest to compare the corrected annual payment z as given 
by (5.2) with the actual annual cost, for a situation where the interest 
rate approaches zero. When the interest rate 7 tends to zero, the dis- 
count rate v tends to one. The limit of z in (5.2), when » tends to one, 
is found by l’Hépital’s rule to be P(r)/r. This is just the average annual 
cost; so the method of Examples 1 and 2 is a limiting case of the 
method of Example 3. 


ITEMS THAT FAIL COMPLETELY 


We now turn to the second type of replacement problem, where we 
are concerned with items that either work or fail completely; that is, 
loss of usefulness is sudden and complete. The principles used in 
analyzing this type of situation have been largely developed by the 
actuarial profession in connection with human lives, and we will use 
their terminology and notation as is convenient. 


Example 4 
The following mortality rates have been observed for a certain type 
of light bulb. 
Week 1 Trea 4 5 
Percent failing by end of week 10 25 50 80 100 


There are 1000 bulbs in use, and it costs $1.00 to replace an individual 
bulb which has burnt out. If all bulbs were replaced simultaneously, 
it would cost 25 cents per bulb. It is proposed to replace all bulbs at 
fixed intervals, whether or not they have burnt out, and to continue 
replacing burnt-out bulbs as they fail. At what intervals should all the 
bulbs be replaced? 

SoLuTion 

We first compute the probability that a new bulb fails in successive 

weeks (no group replacement policy in operation). Let Pi denote the 
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probability that a bulb, which was new when installed, fails during the 
ith week of its life. This probability is clearly the difference between 
the proportion “alive” at the beginning of the ith week and the propor- 
tion alive at the end of the 7th week. We obtain the following table: 


Probability that a new bulb 
installed at time zero fails 
Week during week 7 
@ (pi) 
0.10 
0.15 
0.25 
0.30 
0.20 


1.00 


Note that no installed bulb survives more than five weeks. Thus a 
bulb that has already lasted four weeks is sure to fail during the fifth 
week. However, the unconditional probability of failure during the 
fifth week, for a new bulb installed at time zero, is what we are denoting 
by ps and has the value 0.20. 

With two simplifying assumptions, we can compute the number of 
replacements due to failure in successive weeks, under a policy of no 
group replacement. We assume 


(a) that bulbs that fail during a week are replaced just before the 
end of that week. 

(b) that the actual percentage of failures during a week for a sub- 
population of bulbs with the same age is the same as the expected 
percentage of failures during the week for that subpopulation. 


Let n; denote the number of replacements made at the end of the ith 
week, if all 1000 bulbs are new initially. Then, under our assumptions 
we immediately obtain: 


ar wON re 


No = No = 1000 

= NoP1 = 100 
Ng = Nope + MP1 = 150 + 10 = 160 
Ng = Nopg + MP2 + N2pi = 250 + 15 + 16 = 281 
Ng = Nops + Mp3 + NaP2 + NsPr = 377 
Ns = Nops + mips + Nops + NsP2 + NsP1 = 350 
ne = Nps + Naps + NsPs + apo + NsP1 = 230 
Ny = Nops + Nsps + Naps + NsP2 + Nepi = 286 


It will be noticed that the number of bulbs failing each week increases 
until week 4 and then decreases. It can be shown the number will later 
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start to increase, and it will continue to oscillate until ultimately the 
system settles down to a steady state in which the proportion of bulbs 
failing each week is the reciprocal of their average life. The average life 
is1 X 0.10 +2 X 0.15 + 3 X 0.25 + 4 X 0.30 + 5 X 0.20 = 3.35. In 
the steady state, the number of failures each week will be 1000/3.35 = 
299. Thus the present policy of replacing bulbs only on failure costs 
$299 a week. 

Tf all bulbs were replaced at the end of the first week, the cost would 
be $250 plus the cost of replacing failures during the week; i.e. 250 + 
100 = 350, If all bulbs were replaced every two weeks, the cost during 
two weeks would be 250 + 100 + 160 = 510 dollars. This is an aver- 
age of $255 per week. Replacement at the end of three weeks would 
cost 250 + 100 + 160 + 281 = 791 dollars every three weeks, or an 
average of $264 per week. Thus it would be cheapest to replace all the 
bulbs every two weeks. 


Example 5 

A large population is subject to a given mortality curve for a very 
long period of time. All deaths are immediately replaced by births, and 
there are no other entries or exits. Show that the age distribution 
ultimately becomes stable and that the number of deaths per unit time 
becomes constant. 


SoLuTion 

The reader who associates the wording of this problem with human 
populations will probably realize that no group of people ever existed 
under these conditions. The problem is stated in this way because the 
analysis was first done with hypothetical human populations in mind. 
However, it is only necessary to translate the problem into industry, 
where death is equivalent to a part failure, and birth to replacement, 
and we have a fairly common situation. 

In order to simplify the problem, we will assume that each death 
occurs just before some time ¢ = k, where k is an integer, and that no 
individual survives longer than w + 1 time units. 

Let f(t) be the number of births (replacements) at time t, and let 
p(x) be the probability of a member dying just before age z+1. At 
time ¢, the survivors of the f(t—2) births at time t—z are aged z, so that 
their a priori probability of dying just before time t+-1 is p(x), and the 
number of deaths of such survivors just before time t+1is p(x) f(t—x). 

Hence, the total number of deaths just before time t+1 will be 


Use — 2) we) t=v,0+1o0+2,--. 
pe 
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Since deaths are replaced by births, this quantity equals the births 
at time +1: 


(5.6) f(t+1) = 3 S(t — x) p(x) 
r=0 


This is a difference equation in ¢, and may be solved by substituting 
f® = Aat: 


Ad't! =A >3} p(z)a!-* 
z=0 


We divide by Aa‘~® and obtain 
(5.7) ®t! — [a* pO) + «* p(l) +-+-+ a p(w — 1) + pw)] =0 


This is a polynomial equation in a. We note that, since >> p, = 1, 
0 


one solution is a = 1. It can be shown that the modulus of all the 
remaining roots is less than unity. Let us denote the remaining roots 
by a1, @, «++, a. A little reflection will show that the general solution 
to (5.6) has the form 


SO = Ao + Arar’ + Agae’ +++++ Aga! 


where Ag, Ay, +++, Aw are constants which could be determined from a 
knowledge of the age distribution at some given point in time. Since 
each |a;|< 1, we see that, for large t, f(t) = Ao. Thatis, the number 
of deaths per unit time (as well as the number of births) is constant at 
Ao. The probability of surviving longer than x time units is 1 — 
p(0) — p(l) — +++ —p(«—1). We will use P(x) to denote this proba- 
bility, and interpret P(0) as unity. The number of survivors aged « 
at any fixed point in time is the number of births at an instant x time 
units earlier, multiplied by P(x). Hence, once births and deaths have 
settled down to a constant rate Ao, the number of survivors aged x is 
stable at Ao P(x). 

Since the number of births always equals the number of deaths, the 
size N of the total population remains constant, and we have 


N = Ao D P(z) 
z=0 
or 


(5.8) An= 
x P@) 
z=0 


N 
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Once Ao has been computed by means of (5.8), the limiting age distri- 
bution may be computed by the rule given at the end of the previous 
paragraph; namely, the number of survivors aged zero is Ao, the num- 
ber of survivors aged 1 is Ap P(1), the number of survivors aged 2 is 
Ao P(2), and so on. 

Finally, we show that the denominator in (5.8) may be interpreted 
as the mean age at death. We first write 


DP@) a > P(x) A(z) 
tml) z=0 


which is permissible since the first difference of x is unity. Then upon 
applying equation (I-17) of Appendix T, we obtain 


D Pz) = w+ 1) Pot) - (0 PO — DY @ +1) aro) 
tml z=0 


But P@ + 1) = 0 and AP(z) = —p(z). Hence 


> P(x) = Y (@ +1) p@) 


==0 
ot 

= > y P{age at death is y} 
Pra 


= mean age at death 


Therefore, the ultimate rate of deaths can be obtained by dividing the 
size of the population by the mean age at death. 


STAFFING PROBLEMS 


‘ Problems connected with recruitment and promotion of staff some- 
times lend themselves to analysis similar to that used in industrial 
replacement. We give two examples which will illustrate the principles. 


Example 6 


An airline requires 200 assistant hostesses, 300 hostesses, and 50 
supervisors. Girls are recruited at age 21 and, if still in service, retire 
at age 60. Given the “life” table (Table 3), determine 


1. How many girls should be recruited each year, 
2, At what ages should promotions take place. 


(Assume that all withdrawals and promotions take Place on the anni- 
versary of entry.) 
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TABLE 3. “Lire Tasie” ror Arriine Hosresses 
« Ae d, Pz qz x AZ ds Pz dz 
21 1000 400 0.600 0.400 41 119 6 0.952 0.048 
22 600 120 -800 -200 42 113 7 947 053 
23 «4480 8= 96 -800 -200 43 106 7 +942 058 
24. «384.77 -800 - 200 44 99 6 -936 -064 
25 307 46 -800 -200 45 93 7 -930 -070 
26 261 33 +850 150 46 87 7 -923 077 
27 «228 = 22 875 -125 CY Sele aalie j +915 -085 
28 «4206 « «616 -900 -100 = yam See -906 094 
29 190 9 925 075 49° 66 7 -896 -104 
30.181 8 -950 -050 50 59 6 885 115 
31 173 6 -958 042 51 53) 7 -873 +127 
32 167 6 962 038 62 46 7 -860 -140 
33s: 161 6 -965 -035 53 339s 6 -846 154 
34155 5 967 033 54336 831 -169 
35 ~=150 4 968 -032 5 Ti 815 185 
36 146 5 968 -032 56224 -798 202 
37 141 5 968 -032 57 2 618 «264 -780 220 
38 136 5 +965 -035 58 «14 3 761 +239 
39 131 6 961 039 Sag U1. <3: 741 259 
40 125 6 957 043 
Notation 


d, = number of girls out of original 1000 still in service at age x 


d, = withdrawals between ages z and z-+1 


Dz = probability that a girl in service at age x will still be in service at age 


e+l 


dz = probability that a girl in service at age « will withdraw before age x+-1 


[de =e Meas = Ades Be = 


SoLuTIon 


Ash, 
, 
2 


d, 
Fea Wa Pa + a2 = 1] 


Let z, which is a constant to be determined, be the number of girls 
recruited each year. If 1000 girls had been recruited each year for the 
past 39 years, there would be 4, now in service at each age x. Hence 


the total number in service will be 


59 
> Az = 6480 


a=21 


By proportion, if z girls are recruited each year, the total number in 
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the system will be 6480(z/1000). We require 550. Hence 
6480(z/1000) = 550 
z= 85 
Suppose that promotion from assistant hostess to hostess occurs at 
age 2}. We always have in the system (85/1000)d, girls of age x. The 
number of assistant hostesses in the system is thus 0.0852),, where the 


summation goes from x = 21 tox = 2,;—1. We require 200 assistant 
hostesses. Hence the condition on 2, is 


2-1 
0.085 5° dz = 200 
t=21 
or 
z%-1 
Dd Az = 2532 


r=21 


From the), column of the life table, we find that x; = 25, to the nearest 
integer. 

Similarly, if promotion from hostess to supervisor takes place at age 
22 we obtain the condition 

2-1 300 
»=—— = 
2, ous 3529 
whence x2 = 48. 

Thus 85 assistant hostesses of age 21 will be recruited each year, and 
promotions to hostess and supervisor will occur at ages 25 and 48, 
respectively. 

In staffing problems of this type, with fixed total staff and fixed 
size of staff groups, the proportion of staff in each group determines 
the age at promotion, and conversely. In practice, “career prospects” 
are often considerably improved by the possibility of expansion. 

Before leaving this problem, we will indicate briefly how a life table 
can be obtained. From past experience, we compute the observed 
percentages p’z for each age x. More specifically, p’, is the observed 
percentage of survivals at the end of a year of a group aged z at the 
beginning of the year. [In obtaining these Pz, it may be necessary to 
lump some ages together because of sparsity of data.] Starting with 
any convenient number (here, 1000) for d, at the youngest age z, we 
compute successively 


Meat = Bide Nage = PetiNesi, ete. 


These values )’, are then plotted against z, and a smooth curve is 
drawn passing as close as possible to the points. From the curve, values 
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of Xz are read off for each age x and the results entered in the table. 
The remainder of the table can then be computed from the \, column, 

Of course, a great deal more sophistication can be used. Allowances 
might have to be made for entrants during the year, and mathematical 
methods might be used in curve fitting, but in many problems simple 
methods of the sort described here often suffice. 


Example 7 

An airline, whose staff are subject to the same survival rates as in 
Example 6, presently has a staff whose ages are distributed as in Table 
4. It is estimated that for the next two years staff requirements will 
increase by 10% per year. If girls are to be recruited at age 21, how 
many should be recruited for next year and at what age will promotions 
take place? How many should be recruited for the following year, and 
at what age will promotions take place? 


TABLE 4, Present Srarr Ace Distripution 


Assistants Hostesses Supervisors 
Age Number Age Number Age Number 


21 90 26 40 42 5 
22 50 27 35 43 4 
23 30 28 35 44 5 
24 20 29 30 45 3 
25 10 30 28 46 3 
_— 31 26 47 3 

200 32 20 48 6 

33 18 49 2 

34 16 50 _— 

35 12 51 _ 

36 10 52 4 

37 8 53 3 

38 = 54 5 

39 8 55 ~ 

40 8 56 3 

41 6 57 2 

= 58 = 

300 59 2 

50 


Soturion 

We multiply the number of staff at each age by the probability p, 
that they will remain in service for one year, thus obtaining the age 
distribution one year later. 
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For example, the number of supervisors in one year’s time will be: 


5X 0.947 = 4.735 Aged 43 
4X 0.942 = 3.768 44 
5 X 0.9386 = 4.68 45 
3X 0.930 = 2.79 46 
3X 0.923 = 2.769 47 
3X0.915 = 2.745 48 
6X 0.906 = 5.436 49 
2X 0.896 = 1.792 50 
0X 0.885 = 0 51 
0xX0.873 = 0 52 
4X 0.860 = 3.44 53 
3X 0.846 = 2.538 54 
5X 0.831 = 4.155 55 
0x0.815= 0 56 
3X 0.798 = 2.394 57 
2X0.780= 1.56 58 
0x0.761= 0 59 
2x0 = 0 60 * 
Total 42.802 


* All those aged 60 retire. 


Thus, in a year’s time there will be 43 remaining supervisors. Since a 
total of 55 will be required, 12 girls have to be promoted from the 
hostess group. Assuming that promotion is based solely on age, the 
oldest 12 hostesses will be promoted. In a year’s time, there will be 


6 X 0.952 = 5,712 hostesses aged 42 
8 X 0.957 = 7.656 hostesses aged 41 


Thus, these two age groups together (13.368) will more than suffice 
to fill the vacancies for supervisors. One hostess aged 41 will not be 
promoted. (Computations are carried out to three decimal places to 
avoid errors in summation. Of course, policies must be expressed in 
integral numbers of girls.) 

In a similar fashion, we can compute the number of remaining host- 
esses and thus obtain the number of assistants to be promoted. Finally, 
we obtain the number of assistants remaining and deduce the number 
of recruits required by subtraction. 

The results of the calculations are shown in Table 5. From the table, 
we see that the number of 21-year-old recruits should be 143 next year, 
and 168 the year after that. Next year girls aged 24 and over will be 
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promoted to hostess, and 17 girls aged 23 will be promoted. In the year 
after that, promotion will be at age 23, and in addition 12 girls aged 22 
will be promoted. Next year all but one of the girls aged 41 and over 
will be supervisors. In the year after that, all girls aged 40 and over 
and 3 girls aged 39 will be supervisors. 


TABLE 5. Numser or Grrts in Service at THE Start or Each YEAR 


Present Next Following Present Next Following 
2 year year year Z year year year 
21 90 143 168 41 6 7.656§ 7.357 
22 50 54 85.8 * 42 5 5.712 7.289 
23 30 40 t 43.2 43 4 4.735 5.409 
24 20 24 32 44 5 3.768 4.460 
25 10 16 19.2 45 3 4.68 3.527 
26 40 8 12.8 46 3 2.79 4.352 
27 35 34 6.8 47 3 2.769 2.575 
28 35 30.625 29.75 48 6 2.745 2.534 
29 30 31.5 27.563 49 2 5.436 2.487 
30 28 27.75 26.363 50) = = 92 4.871 
31 26 26.6 26.363 to = — 1.586 
32 20 24.908 25.483 52 4 — ~ 
33 18 19.24 23.961 53 3 3.44 _ 
34 16 17.37 18.567 54 5 2.538 2.910 
35 12 15.472 16.797 Somes 4.155 2.109 
36 10 11.616 14,977 56 3 os 3.386 
37 8 9.68 11.244 57 2 2.394 — 
38 — 7.744 9.370 688. — 1.56 1.867 
39 8 7.72 7.473% 59 2 1.187 
40 8 7.688 7.419 
Total 550 605.363 666.390 

* 12 girls aged 22 will be Total 
promoted hostesses. required 550 605 666 

+17 girls aged 23 will be 
promoted hostesses. Supervisors 50 55 61 

$3 girls aged 39 will be 

Hostesses 300 330 363 


promoted supervisors. 
§ 6 girls aged 41 will be 


promoted supervisors. Assistants 200 220 242 
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MISCELLANEOUS REPLACEMENT 
PROBLEMS 


There are many replacement situations that cannot be classified as 
neatly as those in the examples so far. Apart from problems about 
capital equipment which deteriorates with age, the determination of 
replacement policies usually requires probability theory. We conclude 
this chapter with two general examples, 


Example 8 

A company manufactures automobile batteries at a factory cost of 
$10 each. Battery life is subject to mortality as in Table 6. The 
company has a guarantee policy under which, if a battery fails during 
the first month after purchase, a refund of the full price of a new battery 
is made; if it fails during the second month, a refund of 29/30 of the 
full price is given; during the third month, 28/80, and so on until the 
30th month after purchase, at which time a refund of 1/30 of the full 
price is made. At what unit price should batteries be sold so that on 
average the company will break even? (Assume battery age is zero at 
time of purchase.) 


TABLE 6. Barrery Lire Morrauiry 


Probability of Probability of 
Age in failure in Age in failure in 
months next month months next month 
co UE La 
0 0.05 16 0.00 
1 0.00 17 0.01 
2 0.00 18 0.01 
3 0.00 19 0.01 
4 0.00 20 0.01 
5 0.00 21 0.01 
6 0.00 22 0.01 
xf 0.00 23 0.015 
8 0.00 24 0.015 
9 0.00 25 0.020 
10 0.00 26 0.025 
ll 0.00 27 0.030 
12 0.00 28 0.035 
13 0.00 29 0.040 
14 0.00 30 and over 0.71 
15 0.00 — 
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SoLution 

Let K be the break-even price, and let p; be the probability that a 
new battery will fail during the 7 + 1st month after purchase. 

The average refund on batteries that fail will be 


30 30, <4 
oD, Kp; = 0.0908K 
izo 30 


The break-even price K is the sum of the factory cost and the expected 
refund. Thus 
K = 10 + 0.0908K 


yielding K = $11.00 


Example 9 
A certain piece of equipment is extremely difficult to adjust. During 
a period when no adjustment is made, the running cost increases 


fx) 


0 x z i 


Figure 5.2. Density function for running cost after an adjustment (Range of 2: 
Oto X). 


linearly with time, at a rate of b dollars per hour each hour. The 
running cost immediately after an adjustment is not known precisely 
until the adjustment has been made. Before the adjustment, the result- 
ing running cost is deemed to be a random variable x with density 
function f(x), as shown in Figure 5.2. If each adjustment costs k 
dollars, when should adjustments be made? 


SoLution : . ‘ 
We assume a policy of making an adjustment every time the running 


cost reaches a certain rate z, and wish to determine that value of z for 
which the expectation of the average cost per hour over some long 
period of time is a minimum. 
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The optimal z will first be derived on the assumption that z > X, 
where X (see Figure 5.2) is the highest running cost that can result 
immediately after an adjustment. 

Suppose that an adjustment has just been made at time ¢ = 0, 
resulting in running costs at a rate z, The next adjustment will occur 
when running costs have climbed back to z, at time t = (z — x)/b. 
The total cost for this cycle from one adjustment to another will be the 
cost of an adjustment k, plus the integral from t = 0 to t = (z — z)/b 
of the running cost z+bt. We obtain 


2 — 2? 


2b 


The cost per hour for the cycle is the cycle cost divided by the cycle 
time (2 — x)/b. We obtain 


(5.9) Cost for this cycle = k + 


bk ze+ez 

= 2 2 

The expected cost per hour for the eycle, which is seen to be the 
expected cost per hour for the time that this policy is in operation, will 


be our measure of effectiveness for the decision variable z, Denoting 
this expected cost per hour by h(z), we have 


(5.10) OWE {re [ ~. ue a4 *| dz 


z 


Cost per hour for the cycle = 
z 


To minimize h(z), we differentiate the right-hand side with respect to z, 
in the manner explained in Appendix II. The result is 


dh OSD a apinweensp f(a) 
27 fa} rake | aT a 


= 0 for a minimum 


This condition on z can be rewritten as 


f(z) 1 
(5.11) ie =a oe 


The solution z = z of this equation will be the best value for the 
decision variable z, at least in the range z > X. 

We now show that the minimum cannot occur for z Sali <x. 
repeated adjustments may be necessary at time ¢ = 0 before the cycle 
can begin. Consider the running cost x which results from the adjust- 
ment made just before t = 0: i.e. at the end of the previous cycle, The 
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probability that no further adjustment is necessary at ¢ = 0 is the same 
as the probability of the event {x < z}. If we use the notation 


F@ = fr dz 


P {no further adjustment at ¢ = 0} = 1 — F(z) 
Similarly, we observe that 
P {r further adjustments at ¢ = 0} = [F(z)]"[1 — F@)] 


Each such adjustment will cost & dollars. The expected cost of these 
further adjustments at ¢ = 0 is 


we have 


= ___kF® 
fies Fee Re Ge rz) 


Once all needed adjustments have been made, so that costs are running 
at a rate z, less than z, the remainder of the cycle cost will be as in (5.9). 
Adding (5.9) to the expected adjustment cost gives 


k + 2-2 
1-—F@) 2b 
The cycle time is still {2 — 2]/b; so we have 
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ost per hour or e yee @- all — FO 2 


Cost for cycle = 


In calculating the expected cost per hour, the density used for x is no 
longer f(z), but the conditional density f(a|a <2) = [1 -—F@)]"f@). 
Denoting as before the expected cost per hour by h(z), we have 


kb * f(@) 1 i 
(5.12) hz) = al: rps + a — ca (2 + 2) f(z) 


Inasmuch as 2 is definitely less than X in this calculation, S(x) will 
exceed some fixed positive number in some neighborhood of z, at least 
for density functions of the type shown in Figure 5.2. The first integral 


gaara 
in (6.12) is therefore of the order of re =, which is itself infinite 
oe 


The interpretation is that the expected hourly cost h(z) is infinite for 
any z chosen less than X. Since h(Zo) as given by (5.11) and (5.10) is 
obviously finite, we conclude that the value 2» previously obtained is 
the optimal value for the decision variable z. 
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PROBLEMS FOR SOLUTION 
Set VII 


1. (@) Machine A costs $9000. Annual operating costs are $200 for the first 
year, and they increase by $2000 every year (e.g. in the fourth year operating 
costs are $6200). Determine the best age at which to replace the machine. If 
the optimum replacement policy is followed, what will be the average yearly 
cost of owning and operating the machine? (Assume that the machine has no 
resale value when replaced, and that future costs are not discounted.) 

(6) Machine B costs $10,000. Annual operating costs are $400 for the first 
year, and they increase by $800 every year. You now have a machine of type A 
which is one year old. Should you replace it with B, and if so, when? 

(c) Suppose you are just ready to replace machine A with another machine 
of the same type, when you hear that machine B will become available in a 
year. What should you do? 


2. Machine C costs $10,000. Operating costs are $500 per year for the first 
five years. In the sixth and succeeding years operating costs increase $100 


TABLE 7. Truck Tire Morranrry 
Age of tire at failure, Percentage of tires 


miles 

0 0 

0- 1,000 0 

1,001- 2,000 0 

9,001-10,000 0 
10,001-11,000 0.010 
11,001-12, 000 0.010 
12,001-13, 000 0.015 
13, 001-14 ,000 0.020 
14,001-15,000 0.027 
15,001-16 ,000 0.036 
16,001-17,000 0.040 
17,001-18,000 0.060 
18,001-19 000 0.090 
19001-20000 0.130 
20,001-21,000 0.180 
21,001-22, 000 0.165 
22,001-23 000 0.125 
23 001-24 000 0.080 
24,001-25 000 0.012 
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each year. Assuming a 10% cost of money per year, find the optimum length 
of time to hold the machine before replacing it. 


3. Truck tires which fail in service can cause expensive accidents. It is 
estimated that a failure in service results in an average cost of $250, exclusive 
of the cost of replacing the blown tire. New tires cost $100 each, and are 
subject to mortality asin Table 7. If the measure of effectiveness for a replace- 
ment policy is the average cost per mile, and if tires are to be replaced after a 
certain fixed mileage or on failure (whichever occurs first), determine the opti- 
mum replacement policy. 

[Hint: Assume that failures take place at the exact ages 10,500; 11,500; 
12,500; ete. Compute average useful life per tire for various replacement ages, 
and the expected cost of failures for various replacement ages.) 

[Ans. Replace at 15,000 miles; average cost 0.812 cent per mile] 


4, The following failure rates have been observed for a certain type of light 
bulb. 
End of week Probability of failure to date 
0.05 


DIOR Oe 
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The cost of replacing an individual failed bulb is $1.25. The decision is made 
to replace all bulbs simultaneously at fixed intervals, and also to replace 
individual bulbs as they fail in service. If the cost of group replacement is 
30 cents per bulb, what is the best interval between group replacements? At 
what group replacement price per bulb would a policy of strictly individual 
replacement become preferable to the adopted policy? 


5. A computing machine has a large number of electronic tubes, each of 
which has a life normally distributed about a mean of 1200 hours with a stand- 
ard deviation of 150 hours. If all tubes were to be replaced at fixed intervals 
on the night shift (when the computer is not normally used), the only costs 
would be the costs of labor and parts, which average about $3.00 a tube. 
Replacement of individual tubes which fail in service costs about $8.00 for 
labor and parts, plus the cost of computer down time which runs about $80.00 
for an average tube failure. How frequently should all tubes be replaced? 
[Hint: The average working life of a tube is 1200/16, or 75 days. Consider the 
costs of policies involving group replacement at the end of 50 days, at the end of 
55 days, ete.] 

6. At time zero, all items in a certain system are new. Each item has a 
probability p of failing immediately before the end of the first month of life, 

f failing immediately before the end of the 


and a probability q = 1—p 0! i } 
mre ee (ie. a items fail by the end of the second month). If all items 


are replaced as they fail, show that the expected number of failures f(x) at the 


M24) Operations Research: Methods and Problems 


end of month z is given by 
ppc —(—q)tH 
f(z) = THER [1 (-4) ] 


where NV is the number of items in the system. ; 
If the cost per item of individual replacements is Ci, and the cost per item 
of group replacement is C2, find the conditions under which 


(a) A group replacement policy at the end of each month is the most profit- 
able, : 

(b) A group replacement policy at the end of every other month is the most 
profitable. 

(c) No group replacement policy is better than a policy of pure individual 
replacement, 


7. A piece of equipment either can fail completely, so that it has to be 
scrapped (no salvage value), or may suffer a minor defect which can be repaired. 
The probability that it will not have to be scrapped before age ¢ is f(t). The 
conditional probability that it will need a repair in the interval ¢ to t-+-dt, 
given that it is in running order at age t, is r(t) dt. The probability of a repair 
or complete failure is dependent only on the age of the equipment, and not on 
the previous repair history. 

Each repair costs C, and complete replacement costs K. For some consider- 
able time the policy has been to replace only on failure, 


(a) Derive a formula for the expected cost per unit time of the present 
policy of replacing only on failure, 

(6) It has been suggested that it might be cheaper to scrap equipment at 
some fixed age 7’, thus avoiding the higher risk of repairs with advancing age. 
Show that the expected cost per unit time of such a policy is 


T 
C i) flu) r(u) du+ K 


[100 au 


(c) By differentiating this expression, find a condition to be satisfied by 7’ 
for minimum cost. 
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CHAPTER 


Waiting lines 


Decision situations frequently arise in which units arriving for 
service must wait before they can be serviced. If the laws governing 
arrivals, servicing times, and the order in which arriving units are 
taken into service are known, then the nature of this waiting situation 
can be studied and analyzed mathematically. In this chapter we pre- 
sent some of the more elementary theory of waiting lines and some 
simple applications. 

Arriving units may form one line and be serviced through only one 
station, as in a doctor’s office; they may form one line and be serviced 
through several stations, as in a barber shop; or they may form several 
lines and be served through as many stations, as at the checkout 
counters of a supermarket. In this chapter we assume only one kind 
of queue discipline: first-in, first-out; with a unit going into service at 
the moment a station becomes empty. 

A general assumption running through the chapter is that the 
waiting-line process will eventually become “stable,” in the sense that 
the probability that n units are waiting at any instant remains the 
same as time passes. We shall not treat the transient case (a waiting 
line whose corresponding probabilities have not yet settled down to 
their stable values) or the explosive case (a waiting line that increases 
indefinitely with time). 

The various properties of a waiting line, such as the number in line 
at any instant or the waiting time experienced by a particular arrival, 
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are random variables in the sense of Chapter Two. The reason that 
these variables are random, rather than functionally dependent on time, 
is that arrivals are, in general, random events in time, and service times 
are random variables as well. Thus we shall be concerned with esti- 
mating only the average length of the line at any instant, the average 
idle time of the service facility in a 24-hour period, and so on. 

When confronted with a problem involving waiting lines and the 
costs arising from them, the decision maker would use his knowledge 
of the average characteristics of the queue in attempting to reduce 
costs, maximize output, or whatever. Some of the changes he might 
recommend are: changing the number of servicing stations, changing 
the average service time in one or more stations, splitting a single 
queue or amalgamating several queues, and so on. Such changes 
would be evaluated by first considering their effect on the average 
characteristics of the waiting line, and then translating these changes 
in average characteristics into changes in the chosen measure of 
effectiveness. 


POISSON ARRIVALS AND EXPONENTIAL 
SERVICE TIMES 


In this section, we discuss a class of waiting-line situations that 
lend themselves to a relatively simple analytic treatment. Although 
the assumptions are fairly restrictive, the results apply to some situa- 
tions met with in practice. In any case, the mode of approach de- 
scribed here should prove useful as an indication of an attack on more 
complicated queueing problems. 


Arrivals and Services Considered Separately 


In general, arrivals do not occur at regular intervals in time, but 
tend to be clustered or scattered in some fashion. The “Poisson as- 
sumption” specifies the behavior of arrivals, by postulating the exist- 
ence of a constant , which is independent, of time, queue length, or 
any other random property of the queue, such that 


(6.1) P{an arrival occurs between time é and time t+-At} = \ At 


if the interval At is sufficiently small. A waiting line for which arrivals 
occur in accordance with (6.1) is called a queue with Poisson 
arrivals. This terminology may seem surprising, but is justified by the 
following result.* Let n be a discrete random variable representing 


* For a proof, see William Feller, Probability Theory and Its A pplicatio 2nd ed. 
p. 400 et seq., John Wiley & Sons, New York, 1957. i es 
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the number of arrivals in some time interval of fixed length T. Then, 
if equation (6.1) is satisfied, n obeys a Poisson distribution with param- 
eter AT: 


(6.2) fr(n) = P{n arrivals in time T} = pees 


nl 


A further consequence is that if 7’ is a random variable representing 
the time between consecutive arrivals, then 7’ obeys an exponential 
distribution with parameter X: 


(6.3) g(T) = density function for 7 = ye~*” 


Actually, any or all of the relations (6.1), (6.2), and (6.3) may be 
taken as characterizing Poisson-type arrivals, since it is possible to 
prove any one of the relations starting with any other. 

The mean arrival rate in a waiting-line situation is defined, as might 
be anticipated, as the expected number of arrivals occurring in a time 
interval of length unity. If arrivals are Poisson, we see from equation 
(6.2) that the expected number of arrivals in a time interval of length 
T is XT, Setting T = 1, it follows that the mean arrival rate for 
Poisson arrivals is just A. The mean arrival rate \ is a dimensional 
number, whose units are arrivals per time unit. 

The expected time between consecutive arrivals is just the mean of 
the random variable 7 whose density function is given by (6.3). This 
mean time between arrivals is easily computed to be 1/A. (Note: the 
reciprocal relation between the mean arrival rate and the mean time 
between arrivals is one of the implications of the Poisson assumption 
for arrivals, and may not hold true for other arrival distributions.) 

Discussion of the properties of the servicing facility is complicated 
somewhat by the fact that servicing can take place only when there 
is a unit in the system requiring service. This explains the conditioning 
event in the following assumption. The statement (6.4) is analogous 
to (6.1) and is assumed to hold for each servicing station: 


(6.4) P{a serviced unit is turned out in the interval t 
to t+-At given that a unit is being serviced at time 1} = yAt 


where » is some constant. Here it is assumed that the constant of 
proportionality » is independent of time, of queue length, and of any 
other random characteristic of the waiting-line system. It can be 
shown that, if the output of serviced units obeys (6.4), then servicing 
time is subject to the exponential distribution. More precisely, if s 
is a random variable representing the time it takes the station to 
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complete service on a unit, then 
(6.5) g(s) = density function of s = pe? 


The mean servicing rate for a particular station is defined as the 
conditional expectation of the number of services completed in one 
time unit, given that servicing is going on throughout the entire time 
unit. If servicing times are exponential, it turns out that the mean 
servicing rate is 4, which thus has dimensions of services per time unit. 
Roughly, » may be regarded as the quotient of the output of the servic- 
ing station over some long time interval, divided by the portion of the 
time interval that the servicing station is actually in operation, The 
mean servicing time has the obvious interpretation as the mean of the 
random variable s mentioned above ; for exponential service times, we 
find from (6.5) that the mean service time is 1/p. 

To complete the analogy between Poisson arrivals and exponential 
service times, we note that, if n is the number of potential services in 
time T, i.e., the number of units the servicing station could turn out 
in time 7’ if there were no enforced idle time, then n obeys a Poisson 
distribution with parameter pT’: 


(6.6)  r(n) = P{n services in time T given that servicing is 
going on throughout 7’} 


__ (ute? 
“in ni 


As with Poisson arrivals, the relations (6.4), (6.5), and (6.6) are all 
equivalent, 


The Basic Equations Governing the Queue 


Tn this subsection, we show how the assumptions concerning arrivals, 
Service times, queue discipline, and stability in time may be combined 
to furnish information concerning the total behavior of the waiting- 
line system. This information will take the form of a set of recursion 
relations which lead directly to the probability distribution for the 
random variable n, where n is the number of units in the system at 
any specified instant after stability has been achieved, 

The recursion relations stated below apply to the general case of 
k servicing stations, each with mean service rate ». These k stations 
are fed from one queue, with mean arrival tate A. Though the proof 
for general & is not given here, the proof offered later for two servicing 
stations will illustrate the main ideas, 
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Let 


nm = number in system (number in queue plus number being 
serviced) 
P,, = probability that there are n units in the system at time ¢ (or 
at any other time) 
k = number of servicing stations 


The P; may be calculated successively in terms of Po by 


x 
P; =-Po 
BK 
A+ (n-1 x 
(6.7) Burstein eae ane ui ge 
ny my 
A+ x 
B, a eee ee n>k+1 
ku kp 


The evaluation of Po itself, once the P;, have all been expressea in 
terms of Po, is accomplished by use of the relation 


(6.8) DL Pa=1 


n=O 
We derive the governing equations for the special case k = 2, for 
which equations (6.7) specialize to 


a 
PrraPo 
BL 
A+B » 
6.9 P2= Py -—P 
(6.9) 2 pate ae 0 
A+ Qu oN 
n= Qu Pao 9) Pao n>3 


A diagram of the waiting-line system is presented as Figure 6.1. 
At the instant pictured, there are 3 units in the queue and 5 units in 


Arrivals ———> O O O ——=> Departures 
Queue 


Servicing 
stations 


Figure 6.1. A queue with two servicing stations. 
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the system. We assume, as usual, that queue discipline is such that 
an arrival moves immediately into a servicing station if there is a 
station vacant. 

In this situation we note that the probability of an arrival in a small 
time interval of length At is \At. The probability of a serviced unit 
being turned out in the interval (t, t-+-At) is: 


0 _ if there are no units in the system at ¢. 
uAt if there is one unit in the system at f. 
2uAt if there are two or more units in the system at t. 


The probabilities of more than one arrival in the interval At, of more 
than one service, or of an arrival and a service both occurring in this 
interval are all taken to be zero, since they are proportional to the 
second-order small quantity (At)?. 

The following two events Ip and Ip are equivalent, since event IIp 
is simply a listing of all the ways event Ip can occur: 


Event Ip: {0 units in system at time t+At}. 
Event IIo: {0 units at time é, no arrivals in At} 
+ {1 unit at time ¢, 1 service in At}. 


Since the events are equivalent, the probability of occurrence of both 
events must be the same. Thus 
Pfevent Ip} = Py = P{event II} = Po(1 — d At) + Pip At 
Then 
Po = Po(1 —d At) + Pip At 


which reduces to 


This proves the first equation of (6.9). 
To obtain the second equation of (6.9), we apply the same procedure 
to the events I, and II,: 


Event I,: {1 unit in system at time t+ At}. 

Event I: {1 unit at ¢, no arrivals or services in At} 
+ {0 units at t, 1 arrival in At} 
+ {2 units at ¢, 1 service in At}. 


This leads to 
Py = Pi{1— A+ yp) at} + Pod At + Po(2u) At 
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and hence 


P, = 


A+z r 
P, -—P, 
Qu ; Qu 9 


which is the second equation of (6.9). 
For the events I, and II, with r > 2, the over-all service rate is 
seen to be 2u for all the subevents of II,: 


Event I,: {7 units in system at time t+ At}. 

Event II,: {r units at ¢, no arrivals or services in At} 
+ {r—1 units at ¢, 1 arrival in At} 
+ {r+1 units at é, 1 service in At}. 


Then 
; P, = P,{1 — (A+ 2p) At} + Prd At + Pr4i(2u) At 
an 
X+ 2p ny 
Praia = PrP; r>2 
r+1 Qu r Qu r—-1 = 


The substitution n = r + 1 yields the final equation of (6.9). 

Once the P, are determined, the various characteristics of the 
waiting-line system such as average queue length, idle time of the 
service facility, ete., are readily computed by the probability methods 
of Chapter Two. The following subsections are devoted to examples 
illustrating the way the technique works out in practice. 


Queues: One Servicing Station 

For the case of one servicing station, Poisson arrivals with mean 
arrival rate \, and exponential service times with mean service rate u, 
the governing equations (6.7) specialize to 


r 
P, =-Po 
B 


(6.10) 
A+un 


a 
P, = Pastis = Rae n>=2 
u 


An easy induction shows that 


P.=(*) Pa n>=0 


and the use of the condition 
oO 
YP, =1 
n=0 
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then yields 


(6.11) P,= ( = *\C) n>0 


Equation (6.11) is valid only when the mean service rate u exceeds the 
mean arrival rate i. 

For the applications, we need the distribution of such random 
variables as the waiting time of an arrival before being taken into 
service (we shall call this random variable w) and the total time of an 
arrival in the system (denoted by v; v = w plus service time). 

Inasmuch as there is a positive probability 1 — (\/u) = Po that an 
arrival will not have to wait at all before going into service, the dis- 
tribution for w is part discrete (at w = 0) and part continuous (for 
w> 0). If ¥(w) dw is the probability of the event {w< waiting time 
<w + dv}, the resulting distribution for w is 


r 
P{w=0} =1-- 
BK 


(6.12) 


PY 
¥(w) dw = ( = *) de®)” day w>o 
LB 


C) 


Note that f ¥(w) dw has the value \/u and not unity, inasmuch as 


the event fo > 0} is not a certain event. 

The proof of (6.12) relies on the fact that the event {waiting time of 
an arrival is between w and w+dw} is the composition, for n ranging 
from 1 to o, of the events {n in system just before the arrival, n — 1 
services in the time interval w, 1 service in time dw}. The reader can 
fill in the details of the proof, using equations (6.4), (6.6), and (6.11). 

If we are interested in the conditional density function for waiting 
time, given that a person has to wait, we divide y(w) by P {w > 0}, 
and obtain 


(6.18) ¥(w|w > 0) = (u — r)jeO—Ww * 
*The interpretation of the conditional density function ¥(w|w > 0) is as follows. 
By definition, ¥(w|w > 0) dw is the conditional probability of the event “waiting 


time is between w and w+dw,” given the event “waiting time is greater than zero.” 


w with density function ¥(w|w > 0). The definitions of conditional distribution 
and conditional mean for a discrete random variable are analogous. 
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(Here 
f V(w|w > 0) dw = 1 
0 


since P{w > 0|w > 0} = 1.) 

The density function 6(v) for the total time an arrival spends in the 
system (waiting time plus service time) may be calculated in a similar 
fashion, and turns out to be identical with (6.13), viz.: 


(6.14) 6(v) = (u — AjeO—H” 


From (6.11), (6.12), (6.13), and (6.14) we may readily compute the 
expectations of the various random variables. The results are listed 


below. 
2 


(6.15) E(m) = ( ) average queue length 
u(y — 
Arne average length of non-empty 
(6.16) E(m|m > 0) = ae . Hehe 
r average number of units in 
(6.17) E(n) = sy system 
bese average waiting time of an 
(6.18) (w) = Ho arrival 
1 average waiting time of an 
(6.19) E(w|w > 0) = ea arrival who waits 
average time an arrival spends 
(6.20) E(v) = via in the system 
Example 1 


Arrivals at a telephone booth are considered to be Poisson, with an 
average time of 10 minutes between one arrival and the next. The 
length of a phone call is assumed to be distributed exponentially, with 
mean 3 minutes, 

(a) What is the probability that a person arriving at the booth will 


Have to wait? : 
(6) What is the average length of the queues that form from time 
to time? 
(c) The telephone company will install a second booth when con- 
vinced that an arrival would expect to have to wait at least three 
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minutes for the phone. By how much must the flow of arrivals be 
increased in order to justify a second booth? 


SoLuTIon 
Here \ = 0.1 arrival per minute 
» = 0.33 service per minute 
(a) P{an arrival has to wait} = 1 — Po 
=u, _ by (6.11) 
= 0.1/0.33 = 03 
@) E(m|m>0)=—"— by (6.16) 
wo- Xd 
. 0.33 Se ias 
Wie Sate ee 
© Ew) -—~_ by (18) 
¢ w) = ! 
wu») = 
x | 
if we fix » at 0.33 | 


~ 0.33(0.33 — ») 


We seek the new value ’ for which E(w) = 3 minutes. Solving the 
equation 
oN 
jp ae ese 
0.33(0.33 — ’) 


we obtain an answer of \’ = 0.16 arrival per minute. So we must 
increase the flow of arrivals from 6 per hour, the present figure, to 10 
per hour. 


Example 2 
As in Example 1, we have a telephone booth with Poisson arrivals 


spaced 10 minutes apart on the average, and exponential call lengths 
averaging 3 minutes, 


(a) What is the probability that an arrival will have to wait more 
than 10 minutes before the phone is free? 

(6) What is the probability that it will take him more than 10 
minutes altogether to wait for the phone and complete his call? 

(c) Estimate the fraction of a day that the phone will be in use. 
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SoLuTion 
(a) Again, we have A = 0.1 arrival per minute 


u = 0.33 service per minute 


P {waiting time > 10} = f (1 ms *) AeO—H) Gay 
10 B 


from equation (6.12), 


d 10 
= * ane = 0.3e77 = 0,03 
7 ES 


i) 
(6)  P{time in system > 10} = f (u — A)eO—” dy 
10 


from equation (6.14) 
= el0A—H) = e238 = 0.10 


(c) The expected fraction of a day that the phone will be in use is 
seen to be the same as the expected fraction of any other time interval 
that the phone will be in use. In particular, it is the same as the 
expected fraction of a small time interval At that the phone will be in 
use, which is the same as the probability that the booth is occupied 
at a random instant. Thus the answer is P{n > 0} = 1— Po = 
Nu = 0.8. 


Example 3 

A repairman is to be hired to repair machines which break down at 
an average rate of three per hour. Breakdowns are distributed in time 
in a manner that may be regarded as Poisson. Non-productive time 
on any one machine is considered to cost the company $5 per hour. 
The company has narrowed the choice down to two repairmen, one 
slow but cheap, the other fast but expensive. The slow cheap repair- 
man asks $3 per hour; in return, he will service broken-down machines 
exponentially at an average rate of four per hour. The fast expensive 
repairman demands $5 an hour, and will repair machines exponentially 
at an average rate of six per hour. Which repairman should be hired? 


SoLution 

We shall calculate the total expected cost for an 8-hour day for each 
repairman, and shall choose that repairman for which the total ex- 
pected cost is least. 

First, consider the slow cheap repairman. His salary for the day 
represents a $24 cost to the company. To compute the cost of non- 
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productive time, we note that broken-down machines may be regarded 
as forming a queue, and the single repairman (whichever is chosen) as 
a single servicing station, The expected non-productive time in an 
8-hour day is 


ex 
from (6.17) 


= 8——— = 24 machine-hours 
4-3 


Then the expected cost of non-productive time is $120; and the total 
expected cost with the slow cheap repairman is $144. 


For the fast expensive repairman, salary cost is $40. Non-productive 
time cost is estimated as 


3 
8-——_.. $5 — gig 
6=3 $ $4 


So the total expected cost with the fast expensive repairman is $40 + 
$40 = $80, 


We hire the fast expensive repairman, 


exponential with average unloading time 1 /2n days. The company has 
a large labor supply without regular working hours, and to avoid long 


ships per day. Since the mean servicing rate is not constant in this 


P, ==Po 
A+ (n-1 nN 
p, At Or ees Fee 
nb mp 


(Here \ = 3 ships per day; « = mean service rate with one unloading 
crew = 2 ships per day.) By induction we find 


1~—\" 
-+ (Yew n>0 


n 
n! 


Then the expected number of erews working at any specified instant is 


o 
r 

E(n) = > nP, = ~ = 1.5 crews 
n=0 BK 


ing service will require more than four 


The probability that a ship enteri 
crews is the same as the probability that there are at least five ships in 


the system at any specified instant, given by 
oO 
3) Py xd + Pye Be rae ea 0019 


n=5 


Queues: Several Servicing Stations 


In this subsection, we give properties 
Stations, each with mean service rate #; 
arrivals with mean arrival rate 4. The 
yield immediately the following expressions 


1 n 
P, == (*) Po n=0,1,2,:7%,4—-1 
n! 


1 r\" 
Pa = aca (*) Po nak 


The sequence P,, is seen to be composed of two dissimilar parts (if 
k > 1); and the resulting expression for Py is not simple. For this 
reason, it will be convenient to state queue properties in terms of Po as 
well as A, u, and k. In specific numerical examples, where it is usually 


and examples for k servicing 
fed by a queue built up of 
governing equations (6.7) 
for the Pn in terms of Po: 


(6.21) 
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necessary to evaluate Po, the following expression for Po may be used: 


1 

[=A ila 
eaG) ltgaG) ees 

n=o 7! \u. k!\u/ ku — > 
which is valid if ku > d. (If ku <2, none of the results of this section 
are applicable, since in that case the queue builds up indefinitely, and 
stability in time is never achieved.) 

We recall that, for the case k = 1, the probability that an arrival 
has to wait for service is \/u. For the general case of k servicing sta- 
tions, the probability that an arrival has to wait is just the probability 


that at any specified instant there are at least k units in the system. 
From (6.21) we may compute this probability: 


(6.22) P= 


ee = HO/u)* 
(6.23) Pingel = 2,5, = @=Diu— 7? « 


The formulas for average queue length, average number of units in 
the system, average waiting time for service, and average time in the 
system are listed below without proof.* 


(6.24) E(m) = tp average queue length 
Oar Mis ain rire i Mie ro 
08 50 ee 


Example 5 
An insurance company has three claims adjusters in its branch office. 
People with claims against the company are found to arrive in a Poisson 


*The derivation of these formulas is more complicated than the analogous 
formulas for the case k = 1. For the mode of proof, see Philip M. Morse, Queues, 
Inventories, and Maintenance, John Wiley & Sons, New York, 1958. 
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fashion, at an average rate of 20 per 8-hour day. The amount of time 
that an adjuster spends with a claimant is found to have an exponential 
distribution, with mean service time 40 minutes. Claimants are proc- 
essed in the order of their appearance. 

(a) How many hours a week can an adjuster expect to spend with 


claimants? ; : 
(b) How much time, on the average, does a claimant spend in the 


branch office? 


Sonution 


5 
(a) Here Se arrivals per hour 


= ; services per hour for each adjuster 


eae 
a 1 
ate Toes 1()'$ 
1+5+45(5) TG\a/ 4 
ase 
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The expected number of idle adjusters, at any specified instant, is 


v+an. + 1Ps= 8 (5 + *\i39 
100 ) ee 
_—__ ) = — adjusters 
al oer 3 


Then the probability that any one adjuster will be idle at any tc 
time is 4/9; and the expected weekly time an adjuster spends ce 
claimants is (5/9)40 = 22.2 hours. Note that the same great whee 
be obtained, in a simpler though less instructive fashion, if we a ; 
the expected number of servicing minutes per week (4000) by the 
number of adjusters (3). : 

(b) The deans pas ‘an arrival spends in the system 1s found from 
(6.27) to be 49.0 minutes. 


Example 6 
‘A steel company, which operates its 
ore, is considering the construction of 


own fleet of ships to import iron 
port facilities to support a new 
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plant. Both the number of unloading berths and the type of installa- 
tion in each berth must be decided—on the basis of minimizing total 
unloading costs. 

‘A maximum of three berths can be built; and it is required that the 
same type of unloading installation be in each berth that is built. The 
choice of unloading installation lies among type A, type B, and type C, 
for which the following information is available: 


Operating Capacity: average 


Installation Fixed cost cost tonnage unloaded 
type per day per day _ per day of operation 
A $ 840 $ 840 3600 tons 
B 1350 1350 5800 tons 
Cc 1500 1600 6400 tons 


Fixed costs include such items as the amortization of the original cost 
of the installation over its expected life, general maintenance, etc. ; 
they apply to all days, whether the equipment is used or not. Oper- 
ating costs are incurred only during the time intervals that the un- 
loading equipment is actually in use. 

Ships to be unloaded each carry 8000 tons of ore, and are considered 
to arrive in a Poisson fashion throughout the year with a mean arrival 
rate of five ships a week. Service times for a given type of installation 
are considered to be exponential, with mean service rate corresponding 
to the average capacity column in the table. 

If time spent in the unloading system (waiting time plus unloading 
time) is considered to cost the company $2000 per ship per day, what 
type of unloading installation should be chosen, and how many berths 
should be constructed? 


SoLuTion 

There are nine possible policies, each involving a choice among 
installation types (A, B, or C) and number of berths (1, 2, or 3). The 
combination (A, 1) can be discarded immediately, since the mean 
arrival rate is 5/7 = 0.714 ship per day while the mean service rate 
with the combination (A, 1) is 3600/8000 = 0.450 ship per day. The 
other eight combinations are all feasible, because they all result in a 
mean service rate greater than the mean arrival rate and hence in a 
steady state with a finite expected queue. 

The arrival rate \ for each of the eight policies remains constant at 
0.714 ship per day. The service rate for a single berth will depend 
only on the type of unloading installation, viz.: 
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ua = 0.450 ship per day 
up = 0.725 ship per day 
uc = 0.800 ship per day 


We wish to compute the total expected cost per day for each policy. 
This will be the sum of three component costs: 


1. the fixed cost of the installations. 
2. the operating cost of the installations. 
3. the cost of ship holding time. 


The fixed cost (per day) for the various policies is computed from 
the second column of the above table: 


Number of berths 
1 2 3 


A| — 1680 2520 
peeve B| 1350 2700 4050 (Fixed cost per day) 
CG | 1500 3000 4500 


The operating cost per day for the various policies may be obtained 
by first computing the fraction of the time a berth must stay in opera- 
tion in order to preserve a steady-state balance on the average between 
arrivals and services. Thus, for the policy (A, 2), we have a system 
mean service rate of 0.900 ship per day; then the fraction of time that 
an individual berth will be in operation (called the “system utilization 
factor”) is 4/0.900 = 0.714/0.900 = 0.793. Similar results for all the 
feasible policies are listed below. 


A| — 0.793 0.529 
B | 0.985 0.493 0.328 (System utilizationfactor) 
C | 0.892 0.448 0.297 


The operating cost for the various policies is then given by: 


(Number of berths) X (system utilization factor) 
X (operating cost per day for one berth) 


A} — 1832 1332 
B | 1330 1330 1330 (Operating cost per day) 
C | 1427 1427 1427 


In order to obtain E(w), the average amount of time a ship spends 
in the unloading system, we use equation (6.27) but must first compute 
the eight values of Po associated with the various policies. Results for 
Po, obtained from equation (6.22), are: 


B | 90.90 1.82 1.43 (Values of E(w), 
C | 11.60 1.56 1.29 ne 


Multiplying by \ (the expected number of ships per day) and then by 
$2000 (ship holding cost for one day), we get 


B | 129,805 2599 2042 (Ship holding cost 
di 
C| 16,565 2228 1842 es 


Adding the corresponding fixed costs per day, operating costs per day, 
and ship holding costs per day, we obtain the following total expected 
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costs per day for the various policies: 


B | 132,485 6,629 7422 (Total pee tig cost, 
y, $ 
C| 19,492 6,655 7769 hat : 


Conclusion: Build two berths, and equip them with installation 
type B. 


POISSON ARRIVALS AND ERLANG 
SERVICE TIMES 


In this section, we retain the assumption of Poisson arrivals, but 
broaden the family of service time distributions to a new family, which 
will include as special cases both the exponential distribution and the 
situation where service time is constant. Because of the complexity of 
the analysis, the discussion of this section will be confined to the case 
of a single servicing station. 


The Erlang Family of Service Time Distributions 


The exponential assumption for service times, as specified by (6.5), 
gives only a one-parameter family (the parameter is u) of possible 
service time distributions. This assumption is found to be unduly 
restrictive, partly because the exponential distribution has the property 
that larger service times are inherently less probable than smaller 
service times. A two-parameter generalization of the exponential 
family, called the Erlang * family of service time distributions, is highly 
useful in waiting-line applications. The Erlang family g(t; 4, *) is 
defined by 

g(t; n, 1) = Cre 
g(t; u, 2) = Cate" 
(6.28) git; n, 3) = Cate" 


and generally 

g(t; mk) = Cut tem 

Since each member of the family is a probability density function on 

the range 0 < t < @, the constants C;, are assigned so that the integral 
* Named for A. K. Erlang, who did pioneer workin the field of stochastic processes 

in the early part of the century. 
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of the corresponding function over this range is unity. Values of the 
Cy, are: 


Cr =p 
Cy = 4y? 
27 
(6.29) C3 = ri 
and generally : (uy! 
se ee 


The Erlang family has many interesting properties. The members 
all share the common mean 1/y. The mode is at t = 0 fork = 1; it is 
at t = 1/2 for k = 2; for general k, the mode is located at 

k-1 


io oe 
uk 
The variance for the kth member of the family is 1/ku?. We see that 
the one-parameter family obtained by setting k = 1 is just the expo- 
nential family; as k increases, the mode moves to the right toward 1/, 
and the variance decreases toward zero. For k = «©, the mode is at 
1/u, and the variance is zero; so we may interpret g(t; u, ©) as the 


Probability density 


Service time 


Figure 6.2. The Erlang family of service time distributions. 


family of situations for which the service time is constant and has the 
value 1/y. 

Figure 6.2 gives a suggestion of the way the density functions vary 
as k increases. 

The Erlang family has a close tie with the exponential distribution, 
even beyond the fact that the first Erlang distribution is exponential. 
If we have r random variables xy, x2, - --, Z,, which are independent and 
have a common exponential distribution with mean 1/ry, then the 
random variable x; + 22 +----+ 2; follows the rth Erlang distribution 
with parameter ». Thus a servicing station in which a unit goes through 
r independent service phases, each exponential with mean time 1/ry, 
will have service times obeying the rth Erlang distribution with param- 
eter yp. 

Even when this physical realization of the Erlang distribution is not 
present, some member of the family may give an adequate fit to the 
observed distribution of service times. 


Queues: One Servicing Station 


The derivation of queue properties for the case of Erlang services 
is based on the use of state probabilities, where an individual state is 
defined as the number in the system together with the current servicing 
phase of the unit presently in service, if any. The results, given here 
without proof, are for the case of Poisson arrivals with mean arrival 
rate \, and service times following the rth Erlang distribution with 
mean service rate uz. The reader can verify that these results specialize 
to the results for the exponential case, when r = 1. 


2 
(6.30) E(m) = os . aay average queue length 
rr (= 
r+1 ” average number of units in 
: = — -—— + - 
(6.31), BQ) sia F@oenap system 
Heil 2 average waiting time 


(6.32) E(w) = np STOR 


(6.33) E(v) = "ts " 2 + — average time in system 
2r — w(u — d) 


Example 7 
A barber with a one-man shop takes exactly 25 minutes to complete 


one haircut. If customers arrive in a Poisson fashion at an average 
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rate of one every 40 minutes, how long on the average must a customer 
wait for service? 


SoLuTIon u 
For the case of constant service time, r = ©, and equation (6.32) 
reduces to 


wits 2u(u — A) 


Here \ = 0.025 customer per minute, and » = 0.04 customer per 
minute. 


0.025 


Bw) = Go8)(0.015) 


= 20.8 minutes 


Example 8 

Repairing a certain type of machine which breaks down in a given 
factory consists of five basic steps that must be performed sequentially. 
The time taken to perform each of the five steps is found to have an 
exponential distribution with mean 5 minutes and is independent of 
the other steps. If these machines break down in a Poisson fashion at 
an average rate of two per hour, and if there is only one repairman, 
what is the average idle time for each machine that has broken down? 


SoLuTion 
According to the results stated in the early part of this section, 

service times will follow an Erlang distribution with r = 5 and » = 0.04 

service per minute. The expected time out of operation for a machine 

that has broken down is given by equation (6.33) with \ = 0.033 arrival 

per minute. Thus 

6 0.033 


B= 5° (0.04)(0.007) 


= 96 minutes 


MONTE CARLO SOLUTIONS 


The Monte Carlo methods discussed in Chapter Three are extremely 
useful in waiting-line problems that are difficult or impossible to 


Waiting gies. 6 6 te 9 LAT 


analyze mathematically. Simulated sampling methods are appropri- 
ate, for example, when the simple first-in first-out assumption of this 
chapter is replaced by a different type of queue discipline. In many 
cases, the observed distributions of service times and time between 
arrivals cannot be fitted by mathematical distributions, and the Monte 
Carlo approach is the only hope for any sort of answer Multistage 
queueing, in which the departures from one queue form the arrivals 
for the next, is another difficult area that can be handled comparatively 
simply by Monte Carlo. 

There are several advantages of the Monte Carlo approach over the 
obvious sampling procedure of just looking at the actual process and 
forming a history of arrivals, services, waiting times, queue lengths, 
etc. For one thing, simulated sampling, especially when done on a 
digital computer, can develop months or years of “data” in a matter 
of a few minutes. Another advantage is that manipulation can be 
made of those factors that are subject to control. For example, using 
Monte Carlo methods, we can readily assess the effect of adding one 
or more service facilities without actually having to go to the trouble 
and expense of installing them. Changes in queue discipline can be 
tried out experimentally on paper, without any disruption whatever 
of the actual process itself, and so on. 

The following examples are merely illustrative of the application 
of Monte Carlo methods to queueing. They shcuid not be taken as 
models to follow in practice, both because the situations described are 
perhaps oversimplified, and because far too few arrivals and services 


are investigated. 


Example 9 

Buses are scheduled to pass a certain corner every 15 minutes, but 
actually the arrival of a bus varies normally about its scheduled arrival 
time, with a standard deviation of 3 minutes. People wishing to board 
the bus arrive in a Poisson fashion with mean arrival rate of 4 persons 
per hour, and the number of empty seats on the bus has a Poisson 
distribution with mean 3/2. If no standees are permitted, find the 


average waiting time of an arrival. 


SoLuTIoN y 
The period investigated was taken to be approximately 8 hours. 


The columns in Table 1 (I-VI) are labeled in the order in which they 
are obtained. Columns I and II refer to bus arrival times and occu- 
pancy, and were obtained by Monte Carlo from the given distributions. 
Column III refers to the time interval between the arrival of one person 


TABLE 1. 


I 
Buses, 
number of 
empty seats, 
Monte Carlo 
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II 
Buses, 
time of 
arrival, 

Monte Carlo 
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IV mi 
People, | People, time 
time of between 
arrival, arrivals, 

from III | Monte Carlo 


VI 
People, 
waiting 

time, from 
Il, IV, V 


i = 
RISASHSO 


to 
ot 


ENE [> [eeatees? es Stottcots a5 ce Se or 
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and the next at the bus stop and was obtained by Monte Carlo from 
the exponential distribution e~*. All entries are in minutes, as meas- 
ured from the starting time of the simulated sampling procedure. 

Once the arrival times of people and buses are set down, the assign- 
ment of people to buses becomes automatic. The obvious principle 
is that a person is assigned to a bus if he is waiting when the bus comes 
and there is a seat available for him. Once a person is assigned to a 
bus, his waiting time (col. VI) is obtained by subtracting his arrival 
time from the arrival time of his bus. 

Waiting times for 21, arrivals were obtained, totaling 221 minutes. 
Thus our estimate of the mean waiting time of an arrival is 10.5 
minutes. 


Example 10 

This example involves what is known as a three-stage queueing 
process. Hach arriving unit proceeds in its turn through three servicing 
stations, in a prescribed order. Arrivals are Poisson, and occur every 
20 minutes on the average. Service times in the three stages are 
distributed in the following fashion: 


Stage 1: Normal, mean 10 minutes, $.D. 5 minutes. 
Stage 2: Exponential, with » = 1/15 service per minute, 
Stage 3: Service time constant, air 15 minutes. 


Using Monte Carlo methods, find the expected time in the system 
and the expected time spent. waiting, for a typical arrival. 


SoLuTIoNn 
We investigate the history of 36 arrivals, over a time interval that 


turns out to be of the order of 15 hours. To obtain representative 
time intervals between arrivals, we work with the cumulative distri- 
bution e~°-°”, and obtain 


T = 20 In (1/y) 


where y is a random number between Qand 1. ‘To obtain representative 
service times for the first stage, we use tables of random normal num- 
bers as explained in Chapter Three. For the second service stage, we 
work with the cumulative distribution e~" 157 The results obtained 
for the 36 units considered are given in Table 2. 

‘These numbers are translated into the actual diary of an arrival, in 
Table 3, by the usual considerations. A unit proceeds immediately 
into a service stage if it is unoccupied; otherwise, it waits until the serv- 
icing station isfree. All times are in minutes, measured from the arrival 
time of the item immediately preceding arrival 1. 
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TABLE 2. Monte Carto Sonurion to Exampre 10 


Arrival time measured Service time, first Service time, second 


Arrival from previous arrival service stage service stage 
1 25 ll 9 
2 if 13 4 
3 10 6 12 
4 31 8 26 
5 21 5 13 
6 12 11 7 
ih 36 9 18 
8 32 15 50 
9 7 18 2 
10 1 6 5 
i 32 9 24 
12 18 3 15 
18 12 1 3 
14 ll 6 3 
15 68 10 5 
16 27 6 8 
17 21 6 1 
18 13 15 2 
19 2 sBt 31 
20 7 16 3 
21 41 a 6 
22 7 18 5 
23 37 15 8 

24 26 12 10 
25 12 2 23 
26 2 8 3 
27 60 14 33 
28 2 11 6 
29 60 7 3 
30 2 12 45 
31 27 6 1 
32 37 3 9 
33 22 14 22 
34 27 10 79 
35 10 13 42 
20 


m 
= 
wo 
or 


————— 


Time Time ‘Time Time Time Time 

into outof into outof into  outof 
Ar- Arrival _ first first second second third third Time in 
rival time stage stage stage stage stage stage system 


25 25 36 36 45 45 60 35 
32 36 49 49 53 60 75 43 
42 49 55 55 67 75 90 48 
73 73 81 81 107 107 122 49 
94 94 99 107 120 122 137 43 
106 106 117 120 127 137 152 46 
142 142 151 151 169 169 184 42 
178 178 193 193 243 243 258 70 
185 193 211 243 245 258 278 88 
10 186 211 217 245 250 278 288 102 
11 218 218 227 250 274 288 303 85 
12 236 236 239 274 289 303 318 82 
13 248 248 249 289 292 318 333 85 
14 259 259 265 292 295 333 348 89 
15 327 327 337 337 342 348 363 36 
16 354 354 360 360 368 368 383 29 
17 875 375 381 381 382 383 398 23 
18 388 388 403 403 405 405 420 32 
19 390 403 414 414 445 445 460 70 
20 397 414 430 445 448 460 475 78 
21, 404 430 437 448 454 475 490 86 
22 445 445 463 463 468 490 505 60 
23 «452 463 478 478 486 505 520 68 
24 «478 478 490 490 500 520 535 57 
25 490 490 492 500 523 535 550 60 
26 492 492 500 523 526 550 565 73 
27° «552 552 566 566 599 599 614 62 
28 «554 566 577 599 605 614 629 75 
29 «614 614 621 621 624 629 644 30 
30 616 621 633 633 678 678 693 77 
31 648 643 649 678 679 693 708 65 
32 = 681 681 684 684 693 708 723 42 
33 708 703 717 77 739 739 754 51 
34 730 730 740 740 819 819 834 104 
35 740 740 753 819 861 861 876 136 
36 760 760 771 861 896 896 911 151 


2372 


CHONSMEwWNH 


= 2372/36 = 65.9 minutes 


Average time in system 
= 65.9 — 10 — 15 — 15 = 25.9 minutes 


Average waiting time 
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PROBLEMS FOR SOLUTION 
Set VIL 


1. A 'TV repairman finds that the time spent on his jobs has an exponential 
distribution with mean 30 minutes. If he repairs sets in the order in which 
they come in, and if the arrival of sets is approximately Poisson with an 
average rate of 10 per 8-hour day, what is the repairman’s expected idle time 
each day? How many jobs are ahead of the average set just brought in? 

[Ans. 3 hours; 13 jobs] 


2. At what average rate must a clerk at a supermarket work in order to 
insure a probability of 0.90 that the customer will not have to wait longer 
than 12 minutes? It is assumed that there is only one counter, to which cus- 
tomers arrive in a Poisson fashion at an average rate of 15 per hour. The 
length of service by the clerk has an exponential distribution. 

[Ans. 2.48 minutes per service] 


3. A supermarket has two girls ringing up sales at the counters. If the 
service time for each customer is exponential with mean 4 minutes, and if 
people arrive in a Poisson fashion at the counter at the rate of 10 an hour: 


(a) What is the probability of having to wait for service? [Ans. 0.167] 
(b) What is the expected percentage of idle time for each girl? [Ans. 67%] 


4. A telephone exchange has two long-distance operators. The telephone 
company finds that, during the peak load, long-distance calls arrive in a Poisson 
fashion at an average rate of 15 per hour. The length of service on these calls 
is approximately exponentially distributed with mean length 5 minutes. What 
is the probability that a subscriber will have to wait for his long-distance call 
during the peak hours of the day? If the subscribers will wait and are serviced 
in turn, what is the expected waiting time? [Ans. 0.48, 3.2 minutes] 


5. A bank has two tellers working on savings accounts. The first teller 
handles withdrawals only. The second teller handles deposits only. It has 
been found that the service time distributions for both deposits and with- 
drawals are exponentia) with mean service time 3 minutes per customer. De- 
positors are found to arrive in a Poisson fashion throughout the day with mean 
arrival rate 16 per hour. Withdrawers also arrive in a Poisson fashion with 
mean arrival rate 14 per hour. What would be the effect on the average waiting 
time for depositors and withdrawers if each teller could handle both with- 
drawals and deposits? What would be the effect if this could only be accom- 
plished by increasing the mean service time to 3.5 minutes? 

[Ans. Initially 12 minutes and 7 minutes. Combined waiting 
time 3.86 minutes. Combined waiting time with increased service 
time 11.4 minutes.] 


[ 6, Ata certain airport it takes exactly 5 minutes to land an airplane, once itis 
given the signal to land. Although incoming planes have scheduled arrival 
times, the wide variability in arrival times produces an effect which makes 
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the incoming planes appear to arrive in a Poisson fashion at an average rate 
of six per hour. This produces occasional stack-ups at the airport which can 
be dangerous and costly. Under these circumstances, how much time will a 
pilot expect to spend circling the field waiting to land? [Ans. 23 minutes] 


7. Problems arrive at a computing center in a Poisson fashion at an average 
rate of five per day. The rules of the computing center are that any man wait- 
ing to get his problem solved must aid the man whose problem is being solved. 
If the time to solve a problem with one man has an exponential distribution 
with a mean time of 1/3 day, and if the average solving time is inversely propor- 
tional to the number of people working on the problem, approximate the ex- 
pected time in the center for a person entering the line. [Ans. 8 hours] 


8. A warehouse in a small state receives orders for a certain item and sends 
them by truck as soon as possible to the customer. The orders arrive in a 
Poisson fashion at a mean rate of 0.9 per day. Only one item at a time can 
be shipped by truck from the warehouse, which is located in the central part 
of the state. Because the customers are located in various places in the state, 
the distribution of service time in days has a distribution with probability 
density 4te-**. What is the expected delay between the arrival of an order 
and the arrival of the item to the customer? Service time is defined here as 
the time the truck takes to load, get to the customer, unload, and return to 
the warehouse. Loading and unloading times are small compared with travel 
time. [Ans. 7} days] 


9. A hospital clinic has a doctor examining every patient brought in for a 
general checkup. The doctor averages 4 minutes on each phase of the checkup 
although the distribution of time spent on each phase is approximately ex- 
ponential. If each patient goes through four phases in the checkup and, if 
the arrivals of the patients to the doctor’s office are approximately Poisson at 
an average rate of three per hour, what is the average time spent by a patient 
waiting in the doctor’s office? What is the average time spent in the examina- 
tion? What is the most probable time spent in the examination? 

[Ans. 40 minutes, 16 minutes, 12 minutes] 


10, Attendants at a gas station are paid at the rate of $15 each per 8-hour 
day. Cars arrive for service in a Poisson fashion at an average rate of one 
car every 3 minutes. Fach car is serviced by a single attendant, and the 
resultant profit per car serviced is estimated to average $1.00. The service 
time per car is exponentially distributed with mean 8 minutes. When the 
number of cars waiting for service exceeds two, arrivals become discouraged 
and drive on (lost business). Use Monte Carlo methods to estimate the 


optimum number of attendants. 


11. Before parts are assembled into a vacuum tube, they must be cleaned 
in a degreaser. Batches of parts are brought in randomly (Poisson) at an 
average rate of \ batches per hour, and are cleaned at an average rate of p 
batches per hour. The cost of waiting is C; dollars per batch per hour; and the 
cost of owning and operating a degreaser that works at an average rate p is 
C2 dollars per hour. How large should p be? 

[Ans. For least expected hourly cost, 4 = 4 + V Cr/Ci] 
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12. Devise a time line (or set of time lines) which will represent visually the 
history of the first 10 arrivals in the three-stage queueing problem of Example 
10 in the text. 


13. Prove the result given in equation (6.2), which states in effect that, if 
arrivals occur at random in time, then the number of arrivals occurring in a 
fixed time interval follows a Poisson distribution. [Hint: Divide the interval 
T into N subintervals of length At. The probability of n arrivals is seen to 
follow a binomial distribution with parameter \ At. What happens to this 
binomial distribution as N tends to infinity?) 


14, Following the hint given in the text, derive equation (6.12), which gives 
the unconditional distribution of waiting time for a Poisson situation with one 
servicing station. 


15. Prove that, for the Erlang distribution with parameters » and k, the 
mode is at (1 — 1/k)(1/y), the mean is 1/u, and the variance is ky’. 
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Competitive 


strategies 


Competitive situations are characterized by the fact that two or 
more individuals are making decisions in situations that involve con- 
flicting interests and in which the outcome is controlled by the decisions 
of all the parties involved. Many conflict situations of this type are 
found in economic, social, political, or military problems. 

Many of these situations will involve elements of chance, as for 
example the weather during a military attack, or the cards dealt in a 
poker hand. However, in all these competitive situations, one may 
assume that each opponent is going to act in some rational manner 
and will attempt to resolve the conflict of interests in his favor. 

The approach to competitive problems developed by von Neumann 
utilizes the minimax principle which has as its fundamental idea the 
minimization of maximum loss. Von Neumann’s ideas have led to the 
development of a branch of mathematics known as game theory which 
is useful in handling many competitive situations. However, not all 
the competitive situations that may arise can be analyzed with the 
mathematics of game theory. 

In this chapter we shall examine a number of competitive problems, 
and attempt to develop methods for optimizing the strategy of a 


competitor. 
155 
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COMPETITIVE GAMES 


We shall call a competitive situation a competitive game if it has the 
following four properties: 


(a) There are a finite number of competitors. 

(b) Each of the N competitors has available to him a finite list of 
possible courses of action; this list need not be the same for each com- 
petitor. 

(c) A play of the game results when each of the competitors chooses 
a single course of action from the list of courses available to him. The 
choices are assumed to be made simultaneously, so that no competitor 
knows his opponents’ choices until he is already committed to his own. 

(d) The outcome of a play consists of the particular set of courses of 
action undertaken by the competitors. Each outcome determines a 
set of payments (positive, negative, or zero), one to each competitor. 


In a game where there are N competitors, and competitor 7 has 
n; courses of action available to him, there are NyN2 +++ ny possible 
outcomes to a play of the game. A particular outcome 6 results in a 
payment (7, 6) to competitor 7. If for every possible outcome 6, we 


have 
> RG, 6) =0 
a 


then the game is called a zero-sum game. ‘That is, a game is a zero-sum 
game if the sum of payments to all competitors after a play of the 
game is restricted to be zero. 

The strategy of competitor 7 is the decision rule he uses for making 
the choice from his list of courses of action. This decision tule should 
not require definite information about the opponents’ choice, inasmuch 
as all choices are understood to be made simultaneously and divulged 
at the same instant. Here we restrict our attention to two types of 
strategy. A pure strategy is a decision, in advance of all plays, al- 
ways to choose a particular course of action %. A mired strategy is a 
decision, in advance of all plays, to choose a course of action for each 
play in accordance with some particular probability distribution. For 
example, a person with two possible courses of action might flip a 
coin before each play to decide which course he should take. The 
advantage of a mixed strategy over a pure strategy, after the pattern 
of play has become evident, is that the opponents are kept guessing 
as to what a player’s actual course of action will be. 

A pure strategy may be identified by a number representing the 
course of action chosen. A mixed strate; , for a player with m possible 
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courses of action, is denoted by the set X of m non-negative numbers 
whose sum is unity, representing the probabilities with which each 
course of action is chosen If x; is the probability of choosing course 7, 
we have 

X = (2, 22, +++, Tm) 


where 
2,>0 (i = 1, 2, «++, m) 
and 
m 
Ya=1 
i=l 


It will be realized that a pure strategy is a special case of a mixed 
strategy, where all but one of the 2; are zero, and the exception has 
the value one. A competitor may choose from only m pure strategies, 
but he has an infinite number of mixed strategies to choose from. 

Before we can attempt to find an optimal strategy for a competitor, 
we must have a criterion of optimality. Throughout this section, we 
shall use what is known as the maximin criterion. The exact meaning 
of this criterion will become more clear as we proceed. Roughly, it 
may be visualized as follows. A player lists each of his potential (mixed 
or pure) strategies together with the worst outcome, from his point of 
view, that can result from combinations of his competitors’ potential 
strategies. He chooses the strategy that corresponds to the best of 
these worst possible outcomes. 

We shall say that a game has a solution if the set of optimal strategies 
for the various players has the following property. If the players adopt 
these strategies initially, and after repeated plays gain knowledge of 
the others’ chosen strategies, they will still see fit to maintain their 
initial strategies as giving them the maximum expected gain. The 
value of a solvable game to a player is his expected gain in one play of 
the game with all players using their stable optimal strategies. 

In this chapter, we shall confine our attention to two-person zero-sum 
games. For a discussion of n-person games, the reader is referred to 
one of the several textbooks on the subject of game theory. 


Two-Person Zero-Sum Games 

*A game with only two competitors A and B is most conveniently 
described by means of a pair of matrices. Row designations for each 
matrix are the courses of action available to A; column designations 
are the courses available to B; cell entries are the corresponding pay- 
ments to A for the one matrix and to B for the other matrix. 
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With a zero-sum two-person game, the cell entry in B’s payoff matrix 
will be the negative of the corresponding ¢ell entry in A’s payoff 
matrix; and the matrices will appear as follows: 


B B 
1 2 n 1 2 n 
1) ayn ay An 1] -an —diz — Ay, 
2| aor are an 2| —a2 doe An 
A : A. 
mM | Amt Ome *** Amn mM | —Gm1 —GQm2 *** —Amn 
A’s payoff matrix B’s payoff matrix 


For brevity, we shall usually omit B’s payoff matrix, with the under- 
standing that it is just the negative of the payoff matrix displayed for 
A. 

We shall see later that all two-person zero-sum games are solvable 3 
ie., that stable optimal strategies exist in the sense above, The 
simplest type of game to solve is one where the stable optimal strategies 
are pure strategies. This is the case if and only if the payoff matrix 
contains a saddle point; a saddle point is an element of a matrix that is 
both the lowest element in its row and the highest element in its 
column. Equivalently, a saddle point can be defined as an element 
of a matrix that is at once the largest of the row minima and the smallest 
of the column maxima, 

The solution in such cases is for player A to use his pure strategy 
corresponding to the row through the saddle point, and for B to use his 
pure strategy corresponding to the column through the saddle point. 
The value of the game to A is the element at the saddle point, and the 
value to B is its negative. We shall not prove these statements here; 
however, the proof follows the discussion of Example 2 where optimal 
mixed strategies for a two-person game are obtained. Since it is so 
easy to discover whether or not a given matrix contains a saddle point, 
the first step in the solution of any two-person zero-sum game is to 
examine the matrix to see whether a saddle point exists, 


Example 1 
The payoff matrix of a game is given below. Find the best strategy 
for each player, and the value of a play of the game to A and B. 
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B 
Perry Vv 


SoLurion 
If we circle the row minima and put squares around the column 
maxima, we obtain 
B 
I Ul miv v 


1 |p} 3 1B @ 
1] 6 5&6 7 
TDG) ae 28 cae 
vl 5 [6] 2 2 @ 


We see that the matrix has a saddle point of 4, in the second row and 
third column. According to the rule, the optimal strategies are pure 
strategies, viz.: 


Player A: Strategy II. 
Player B: Strategy III. 


The value of the gameis 4 to A, —4to B. Note that these strategies 
are truly stable in the minimax sense. When B becomes aware that A 
is always playing II, he will maintain strategy III because it cuts his 
loss to the lowest possible value, Similarly, A will continue with IT 
even after discovering that B is sticking to IL, since strategy II gives 
him the greatest possible gain. 


If there is no saddle point, the best strategies are mixed strategies, 
and the solution of the game consists of evaluating the probabilities 
with which each pure strategy should be used. If mixed strategies are 
used, we cannot guarantee a minimum gain at any particular play. 
Instead we endeavor to insure that the minimum possible value of the 
average or expected gain shall be as large as possible. 
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In Example 2 we will see how the above rule fails when there is no 
saddle point, and why in these circumstances a mixed strategy is 
inevitable. 


Example 2 ps 

The payoff matrix for A in a two-person zero-sum game is given 
below. Determine the value of the game and the optimum strategies 
for both players. 


SoLurion 
If A is looking for a pure strategy he examines the rows and discovers 
the row minima to be 


Row 1 ay 8 SN 


Minimum -1 -2 -3 


He therefore decides to use I, insuring himself of a gain of at least —1. 
By similar reasoning B decides to use his third strategy, thus insuring 
that he cannot lose more than 2, 

Now suppose A and B start playing with these pure strategies. As 
soon as A discovers that B always plays III, he realizes that he (A) can 
increase his gains from 1 to 2 by playing II. When B realizes A is 
playing II, he will change his strategy to II so that he (B) can gain 2 
at each play. When A sees this, he will change to III, and soon. The 
strategies will never become fixed, because, whenever one player dis- 
covers the current pure strategy of his opponent, he finds that a change 
in his own pure strategy will increase his gains. In contrast, when a 
saddle point exists, if one player changes from his pure strategy through 
the saddle point while the other player maintains his strategy, the 
player who changes will never increase his gains. Of course even with 
a saddle point, if either player is foolish, and consistently chooses the 
wrong strategy, the other player can use this to his advantage. How- 
ever, we are assuming that the players behave rationally and that both 
use the strategy that is optimal from their own viewpoint. 

Returning to our problem with no saddle point, it is clear that 
neither A nor B will use a single pure strategy. It is also clear that, if 
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one player ever realizes that the other player is using a fixed sequence 
of pure strategies, he will be able to anticipate the next play of his op- 
ponent and take advantage of the situation. A good way to avoid a 
fixed sequence is to make the strategy for each play a random variable: 
i.e., to use a mixed strategy. 

Let X = (x1, t2, x3) and Y = (yj, ye, ys) denote the optimum mixed 
strategies for A and B, respectively. Then the probability that A 
plays 7 is x;, and the probability that B plays j is y;. The gain to A 
at each play is now a random variable a, and the expected value of a 
play to A is 

E(a; X,Y) = Dasa 


where a;; is the gain to A when A plays 7 and B plays j. 

A wishes to choose X so that, regardless of the nature of Y, his 
expectation at each play exceeds some amount 2; 0; is to be as large 
as possible. 

Similarly the expected gain for B is 


E(—a;X, Y) = x (—ais)aiys 


and B will play so that the expected value of his gain exceeds some 
number v9; v2 is to be as large as possible. 

Using X, and Y, to denote the optimum strategies for A and B 
respectively, we require X,, Yo, 01, v2 to satisfy the conditions 


E(a; Xo, Y) > for all Y 
and 
E(—a;X,Y.)>v2 forall X 
There is a remarkable theorem due to von Neumann which states 
that in any zero-sum two-person game the quantities v,, v2 introduced 
above are the negatives of each other. We will write v for »; and —v 


for v2. 

Then we have 
(7.1) E(a;X,,¥)>v forall Y 
(7.2) E(a;X,Y.)<v forall X 
Hence 


E(a; Xo, Yo) =v 


This means that, if both players use their optimal minimax strategies, 
their achieved expected gains coincide with their minimax expected 


gains. 
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Substituting the numerical values of a;; in (7.1) yields 


yi(—% + X2 + 3x3) + yo(2ay — 2xq + 4g) + yg(x1 + Qe — 3x3) > v 


Since this is to be true of all y;, it is true when any two of the yj are 
zero and the third is one. The three implied inequalities are: 


—%, + +323 >0 
(7.3) 2x, — 2xe + 423 > 0 
ay + 2x2 — 323 > v 

Similarly, from (7.2) we obtain 
—y + 2y2+ ys Sv 
(7.4) 41 — 2y2 + 2y3 <0 


841 + 4y2 — 3y3 Sv 
In addition we have 


& + 22 +23 = 1 
(7.5) 
t+ y2+y3 =1 
and 
al able oy Be 
(7.6) 


yi20 41=1,2,3 


The equations and inequations (7.3) through (7.6) serve to determine 
all seven of the unknowns 2;, y;, and v. 

We note first that altogether we have 14 relationships among 7 
variables. It is clear that not all the z; and y; can be zero, but it is 
possible for equality signs to hold in all the relationships of (7.3) and 
(7.4). We first assume that this is so, and attempt to solve the following 
set of eight equations in the seven unknowns: 


—% + 2% + 3x3 = 0 
2a, — 2x2 + 4x3 = v 
% + 2x2 — 323 = » 
—yr + 2y2+ y3 =v 
(7.7) 41 — 2y2 + Qyg =v 
3841 + 4y2 — 3y3 = v 
+ t+ w=1 
wt yt yg=1 
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If these equations are found to be consistent, and have positive 
solutions for the z; and y;, we have a set of values satisfying all 
the relationships (7.3) through (7.6). If not, we have to replace one 
or more of the equality signs in (7.7) by strict inequalities until we 
find a suitable set of non-negative solutions. 

In this particular problem it is easy to verify that the eight equations 
are consistent and have the solution 


v1 2 %s yw Y2 Ys v 
17/46 20/46 9/46 14/46 12/46 20/46 30/46 


Consequently we have a solution to the game. Note that A now has 
an expected gain per play of 30/46, whereas, if he were to stay with his 
best pure strategy, he could only be sure of not losing more than 1. 
B now has an expected loss of —30/46, whereas with a pure strategy 
he cannot be sure he will lose less than 2. 


Example 3 
Show that, for any zero-sum two-person game where the optimal 


strategies are not pure strategies (i.e. there is no saddle point), and 
for which A’s payoff matrix is 


[ee | 
G21 22 
the optimal strategies (71, a) and (y;, y2) are determined by 


% = ag — Agi Yi G22 — Ao 
—= , = 
%  -41 — Ane Y2 G11 — G1 


(7.8) 


and the value v of the game to A is 


11492 — 412421 
As ee le 
2) 11 + do2 — (a12 + aa1) 


SoLuTION 
Since there is no saddle point, the largest and second largest elements 


of the matrix must constitute one of the diagonals. This means that 
there are only 8 possibilities (instead of 24) for the ordering of the 
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entries @11, G12, @21, G22. They are: 

Qy1 > Ag2 > a2 > Aa1 

a1 > dag > agi = Ae 

gq > 11 > Ag > An 
(7.10) gq > a1 > G21 > Ae 

Qy2 > Agi > a1 > Age 

G2 > Aq > Avg > ay 

1 > Aya > ay, > Age 

agi > Ay2 > dee > ayy 


The six relations for determining optimal strategies are, regardless 
of the ordering, 
4 +22=1 


w+ yz =1 
44101 + Agi%2 > v 
Qy2%1 + Aggt2 > v 
A141 + A242 Sv 
G11 + Az2¥2 Sv 
The third and fourth restrictions, regarded as equalities, together yield 
%1 Gen — Agi 
os iy 1 — Aye 


The right-hand side is ‘positive for each of the eight orderings in 


(7.10). Hence we may proceed to evaluate 2 and x» separately, using 
the first of the six relations, and obtain 


22 — a1 
m= 
41 + 22 — (a2 + a1) 
qi — a. 
oe 12 


poh ee are Ta 
411 + dg2 — (a2 + apy) 


We compute the value of the game to A by substitutin, thes: i 
Re ene : ti- 
ties in either the third or fourth relation regarded as an ae ality. 7 Mi 
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result is 
11422 — 12491 


i a are 
a1 + 22 — (d12 + o1) 
By using the fifth and sixth inequalities, we obtain 
uw _ Gea — Fi2 
Yo G1 — Gar 
which again is positive for each of the eight orderings (7.10). 


It should be pointed out that, if the matrix has a saddle point, equa- 
tions (7.8) and (7.9) may give an incorrect solution. 


Example 4 

In a game of matching coins with two players, suppose A wins one 
unit of value when there are two heads, wins nothing when there are 
two tails, and loses $ unit of value when there are one head and one 
tail. Determine the payoff matrix, the best strategies for each player, 
and the value of the game to A. 


SoLuTION 
The payoff matrix (for A) is seen to be: 
B 
as ppl 
H|+1 -3 
7 fa ae BY 


Since there is no saddle point, we know that the optimal strategies 
will be mixed strategies. The solution is obtained most easily by use 
of the formulas given in Example 3. We have 
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Thus eath player should show heads 1/4 of the time and tails 3/4 
of the time. The game is unfair to A, as he will lose on average 1/8 
unit each time the game is played. 


Example 5 

In a campaign for mayor, the Whigs and Populists are each planning 
to nominate a candidate in closed session, with the results to be an- 
nounced simultaneously. Gamblers are expected to offer the following 
odds for the possible combinations of candidates: 


Whigs Odds Populists 
Eise 2:1 Fogg 
Eise 3:1 Kale 
Eise 1:32 Smith 
Nix 3:7 Fogg 
Nix 3:2 Kale 
Nix 1:3 Smith 
Humph 3:1 Fogg 
Humph 1:4 Kale 
Humph 2:1 Smith 


The political bosses wish to select candidates in accordance with 
standard minimax criteria. What are optimal strategies for the two 
parties? 


SoLuTIon 
If the payoff for a party were taken as its probability of winning, 
the two payoff matrices in this “game” would be as follows: 


Populists Populists 
E | 0.667 0.750 0.333 E | 0.333 0.250 0.667 
Whigs N | 0.300 0.600 0.250 Whigs N | 0.700 0.400 0.750 
H | 0.750 0.200 0.667 H | 0.250 6.800 0.333 
Whigs payoff matrix Populists payoff matrix 


With these payoff matrices, the game is not zero-sum. The game 
can be converted to zero-sum by taking the payoffs to be the differences 
between the corresponding win probabilities. This change may be 
Justified, to some extent, by noting that the original payoffs ea the 
new payoffs increase and decrease together, The payoff matrix for the 
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Whigs becomes 
Populists 
F K 8 


E 0.333 0.500 —0.333 
Whigs N |} —0.400 +0.200 —0.500 
H 0.500 —0.600 0.333 


The payoff matrix for the Populists is, of course, just the negative of 
that for the Whigs. 

To solve this game, we first check whether the optimal strategies 
are pure strategies. The largest row minimum is —0.333, and the 
smallest column maximum is +0.333. Hence the optimal strategies 
will be mixed strategies. 

The next step is to try to reduce the dimensionality of the game 
by the elimination of recessive pure strategies. A pair of pure strategies 
i and j are said to be in a dominant-recessive relationship if pure 
strategy i has a payoff greater than or equal to pure strategy j for 
each possible course of action of the opponent. If a pure strategy is 
recessive, it can be shown that its choice probability in the optimal 
mixed strategy is zero. Hence the rows (columns) corresponding to 
the various recessive pure strategies can be eliminated from the payoff 
matrix, in computing optimal strategies. 

For the Whigs, we find that Hise dominates Nix; and, for the Pop- 
ulists, Smith dominates Fogg. Nix is nixed, Fogg is lifted, and the 
Whigs’ payoff matrix becomes 2 by 2, viz.: 


Populists 


0.933 
71, my = 0.528, 2 = 0.472 
t 0.833 
yi. (0.666 


t= 0.878), .,.) Ya.=,0.624 
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Both Whigs and Populists should use a random process in selecting 
their candidate. The process used by the Whigs should have a 0.528 
chance of selecting Hise and a 0.472 chance of selecting Humph. The 
Populists should be 0.378 sure of picking Kale, and 0.624 sure of 
backing Smith. 

The value of the game tothe Whigs is —0.019; so it seems that the 
Populists have a slight edge. This supposition is strengthened by the 
fact that, when the optimal mixed strategies are used, the Populists’ 
probability of winning the election is 0.510. 


Example 6 


In the following 3-by-3 game we wish to find optimal strategies and 
the value of the game 


B 
Laan 8 
1] 3 -2 4 
A2/|-1 42 woth: 
Biles to 2.56 


SoLuTIon 
As in Example 2, we arrive at the following relationships: 


82, — te +223 >0 
—22, + 4x2 + 2x3 >0 
4a, + 2xe + 6x3 > 0 
3y1 — 2y2 + 4yg <0 
(7.12) —t1 + 4y2 + 2y3 < v 
2y1 + 2yo + bys <v 
+ t2+ 23 =1 
m+ yet ys =1 
a: >0 t=1,2,3 
420 7=1,2,3 


We attempt to find a solution by taking any four of the first six 
relations as equalities, together with the seventh and eighth, and soly- 
ing these six equations for the x; and y; in terms of v. If these equations 


= 
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are consistent and if a value of » can be found that results in positive 
values for the x; and y;, and if these latter values also satisfy the two 
remaining unused relations, then we have a solution to the game.* 

For example, if we took the first four relations as equalities, we would 
obtain the system 


3x1 — 2% +203 =v 
—22, + 4x2 + 273 = 0 
4a + 2x9 + 613 = 0 
(7.18) 
3y1 — 2yo + 4y3 = v 
a+ t2+ 23 =1 
nt yet ys=1 


The first three equations imply 21 = x2 = —23, v = 0; since all 2; 
are non-negative, we have z; = t2 = 43 = 0. However, these values 
do not satisfy the fifth equation; and so this trial set of six equations 
cannot furnish a solution to the game, We then try another set of equa- 
tions, and hope for a legitimate solution. Continuing in this way, we 
will eventually try the set of equations 


821 — t+ 273 =v 
—2x, + 4x2 + 2x3 = 0 
—1 + 4y2 + 2y3 = 0 
2y1 + 2y2 + bys 
a+ t+ t3=1 


(7.14) 


I 
e 


w+ yt y=l 


We find that these six equations in seven unknowns are consistent, 
and have the solution X = (0,0, 1); ¥ = (2, 2,0);» = 2. This pro- 
posed solution is seen, by substitution, to satisfy all the require- 
ments (7.12), and hence is an actual solution to the game, (There 
are other solutions as well; see problem 3 in the list at the end of this 


chapter.) 

* The careful reader may have noted that the dimension of this particular game 
could have been reduced by the elimination of B’s strategy 3, which is dominated 
by strategy 1. The game is worked out as a 3-by-3 game in order to illustrate the 
algebraic techniques involved in the general case. 


170 He Operations Research: Methods and Problems 


Before proceeding to a quick method for approximating the solution 
of a game, we summarize the analytical procedure: 


1. Look for a saddle point. 

2. Look for dominance, and eliminate recessive strategies. 

3. If the elimination of recessive strategies yields a 2-by-2 matrix, 
use the formula of Example 3. If it yields a 2-by-m or an m-by-2 
matrix, a graphical modification of the algebraic method as explained 
in problem 2 of Set IX is probably quickest. If it yields a matrix 
whose small dimension exceeds 2, solve by the algebraic method. 


Approximate Solution of Two-Person Zero-Sum Games 


Games with large matrices are extremely tedious to solve by standard 
methods. An approximate method exists which may be used to find 
‘the value of the game to any desired accuracy. Optimal strategies can 
also be approximated. The basic idea in solving the game in this 
manner is that each player acts under the assumption that the past is 
the best guide to the future and will play in such a manner as to 
maximize his expectation. The method, which will work for any size 
matrix, is best illustrated by an example. 


Example 7 
The matrix of a certain two-person zero-sum 3-by-3 game is given 
below. Show how to obtain an approximate solution by iteration, 


B 
Ls 2 3 
HE Le 
A 2/3 0 0 
8+ 0 2 v1 


SoLuTion 

Player A arbitrarily selects any row and places it under the matrix, 
Here we shall arbitrarily select row 1, Player B examines this row and 
chooses a column corresponding to the smallest number in the row. 
This is column 2. Column 2 is then placed to the right of the matrix. 
Player A examines this column and chooses & Tow corresponding to the 
largest number in this column. This is row 3. Row 3 is then added to 
the row last chosen and the sum of the two rows is placed beneath the 
row last chosen. Player B chooses a column. corresponding to the 
smallest number in the new row and adds this column to the column 
last chosen. In case of a tie that row or column must be chosen by 
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some consistent mechanism. A coin flip will do, but in this example 
we shall arbitrarily say that in case of a tie the player will select the 
row or column different from his last choice. This procedure may be 
continued in the same manner. Ten iterations are shown below with 
the smallest numbers in each succeeding row and largest numbers in 
each succeeding column circled. The approximate strategies after 10 


> 
ewe 
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oe 3 @a@e w wl[rlolwlo 
@®oo 
vn © 
@® oer 


e 
. 
© 


to vo x0 
* Choice made by ftie-breaking rules given above. 
iterations are found by dividing the number of circled numbers in 
each row or column by the total number of circled numbers. Thus 
A’s approximate strategy is (7'p, 1°, 1'0) and B’s approximate strategy 


is (3%, 10, 10). 


Upper and lower bounds for the value of the game can be determined 
by dividing the highest number in the last column, 11, by the total 
number of iterations, 10, and by dividing the lowest number in the 


last row by the number of iterations, 10. Thus 


tr srsto 
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Sometimes narrower bounds can be determined by examining the 
restrictions given by all the rows and columns, For example, the third 
last row indicates that a lower bound for » = $. This is a greater 
lower bound than 3%. And the sixth column indicates a lower upper 


bound, $. Thus 
$S0< 


In this case we have determined the value of the game exactly as 
v = 1. In general, we do not get exact convergence in this manner 
but only a continual narrowing of the upper and lower bounds. 

The exact solution of this game is v = 1 with optimal strategy for 
player A, (4, 4, 4) and optimal strategy for player B, (4, 4, 4). The 
approximate solutions are not too far off, and would get better with 
more iterations. If there is more than one solution, the approximate 
strategy may not converge on any given solution. However, at any 
point the approximate strategy will approximate one of the several 
solutions of the game. 


The iterative method is very powerful for handling large games. It 
is the kind of method that is easily adaptable to machine computations. 


OTHER COMPETITIVE SITUATIONS 


In this section we consider some competitive situations which are 
not competitive games in the strict sense of the previous section, but 
which are amenable to simple analysis. The first two examples show 
how to determine optimal bidding strategies in two common types of 
bidding situations. The third example deals with a poker hand j and 
the last two discuss a very idealized competitive marketing situation. 

The reader will note that the criterion used in this section is maxi- 
mum gain rather than minimax (or, more correctly, maximin) gain, 
This was selected not because one criterion is considered superior to 
the other but because the application of minimax methods to problems 
where the choice variable is continuous usually involves techniques 
more advanced than those given in this book. 


Example 8 
A manufacturer engaged in bidding for contracts makes it a practice 
to record, in addition to his own submitted bid, his cost estimate K 
and the lowest opponent bid B on each job. He finds that the random 
variable x, where 
B-K 
K 


x 
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follows a normal distribution f(z) with mean 0.15 and standard devia- 
tion 0,05, 

If the manufacturer is interested in maximizing his expected profit 
on each job, and if it is assumed that his competitors’ bidding habits 
will not change, what should the manufacturer’s bidding strategy be? 


SoLurion 
Let A be the manufacturer’s bid, and B be the lowest opponent’s 
bid. Then his profit P will be 


A-K if A<B 
0 iff A>B 
Let E(P; A) denote the expected profit with bid A. Then 
E(P; A) = (A — K) P{B>A} 
We replace the decision variable A by the equivalent variable z, where 
pili bier 
K 


4 


Then 
E(P; A) = KzP{x > 2} 


J 
= Kz} f(z) dz 
z 
where x is the random variable in the statement of the problem, and 


f(x) is its density function. 
The quantity 


ef fe) ax 
is maximized by the solution z of the equation 
(7.15) ff s@ae-21@ =0 


Graphically, E(P; 2) is a maximum when the two cross-hatched areas 
in Figure 7.1 are equal. 

Numerically, z may be found by using tables of ordinates and areas 
of the normal distribution. It turns out that z = 0.117 is a solution 
to the preceding equation. Thus the manufacturer's optimum bidding 
strategy is to add 11.7% to his own cost estimate in submitting a bid. 
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A critical assumption in the statement of the problem is that the 
competitors’ bidding habits will not change. It will be realized that 
any of the earlier examples of this chapter could have teen “solved” 
by making a similar assumption about an opponent’s str: tegy. How-- 
ever, when there is only one opponent, it seems more plausible to 
assume that he will behave rationally, i.e. that he will use to hig 
advantage the knowledge that accrues to him concerning our strategy. 
In a bidding situation, the competition is from an amorphous collection 


Figure 7.1. Graphical solution of equation (7.15), 


of opponents, and it is more reasonable to suppose (at least in the short 
run) that their actions viewed as a group will not change. 


Example 9 

Two objects of value $100 and $130 are to be auctioned at a public 
sale. Only two bidders are interested in these items. Bidder A has 
$100 available, and bidder B has $80 available. What should be their 
strategies if each bidder is interested in maximizing his own gain? 


SoLuTion 

Let us assume for simplicity that successive bids increase in steps 
of $4. At any stage of the bidding, a player has the option of letting 
his opponent’s bid stand, or of increasing it by $A and leaving the next 
decision up to his opponent. 

Suppose that bidder B has just bid x dollars for the first object. 
Let a(x, 0) be A’s gain if he lets B’s bid stand, and a(x, A) be A’s gain 
if he increases the bid to z-++A and B lets him win. If A lets B take 
the first object for x, then A is sure to win the second object with a 
bid of 

80-—xz+A 


since B’s total capital is $80. Thus 
a(z, 0) = 130 — [80-2 + a] =50427—4 
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If A wins with a bid of a+A, then B will win the second object (if 
a > $20) with a bid of 


130 — (a+ A)+A= 130-2 
Thus 
a(z, A) = 100 — («+ A) = 100-2 -A 


We see that A should bid 2+A provided 100 —« — A> 50+ 
x — A: i.e. provided x < 25. If x > 25, A should let B win the first 
object for that amount. 

Player B has a similar decision to make. If A has just bid x (and 
if « > 20) then B’s gains A(w, 0) and A(z, A) are given by 


B(x, 0) = 130 — (100 —2 + A) =30+2-—A 
B(x, A) = 100 — (@ + A) = 100-2 -A 


So B should bid #+-A on item 1 provided x < 35. 

It is clear that B will take the less expensive object for about $25, 
and A will take the more expensive object for about $55. Both parties 
end up with the same net gain; and if the other item were offered first, 
the results would be exactly the same. The order of the items is im- 
material in all such problems, regardless of the relative values of the 
two items and the relative resources of the competitors, The net gains 
will be equal when conditions are such that the optimal strategies lead 
to a division of the two items between the competitors; otherwise, of 
course, the net gain for one of the parties will be zero. 


Example 10 
A and B are playing a variety of poker in which all the cards dealt 


are face up. After four cards have been dealt to each person the hands 
are as follows: 


‘A: ace of clubs, ace of diamonds, ace of hearts, deuce of hearts. 
B: four of hearts, five of hearts, six of hearts, seven of hearts. 


At this point there is $1.00 in the pot and A bets $1.00. One more 
card is to be dealt to each player. What should B do? At what, size 


bet by A would B be wise to drop? 


SoLuTIon ‘ ; 
The rationale of poker is to bet only if one’s expectation of the 


final pot is greater than or equal to the amount one must put in to 
reach the final pot. In this problem, all the cards can be seen, and the 
expectations can be calculated exactly. In a general poker problem 
cards are hidden and expectations must be estimated. Potential im- 
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provements in the two hands by the fifth card are: 


A: Four of a kind—1 chance. Full house—3 chances. 
B: Straight flush—2 chances. Straight—6 chances. 
Flush—5 chances. 


The probability of B’s winning is the probability of a straight flush, 
plus the probability of a flush or straight times the probability A does 
not improve: 


Pr{B wins} = 2 + 14-38 = 0.273 


If B meets the raise and calls, he will put $1.00 in the pot. There will 
be $3.00 in the pot and his expectation will be 


0.273 X $3.00 = $0.82 


This indicates that B’s calling of A’s bet would be a bad investment, 
Suppose that A had raised the amount x. Then there would be 
1+ 2z dollars in the pot if B called. He should call if 


0.273 (1 + 2c) >a 
0.60 > x 


As long as A does not bet more than 60 cents, B should meet the raise 
and call. Otherwise B should drop out. 


Example 11 

Two competitors A and B dominate an industry, and have fixed 
amounts X and Y, respectively, available for advertising and sales 
promotion. The country has been divided into n sales districts, which 
are the same for both companies. In district ¢ the industry sales 
potential is s;, and this sales potential is actually achieved in each area 
regardless of the total amount of sales promotion and advertising 
devoted to that area. Assuming that the industry sales in each district 
are divided between the two competitors in proportion to the amount 
spent on advertising and promotion within the district, how should 
the competitors divide their promotion funds among the n districts? 


SoLuTion 
Let x; = amount spent by A in 7th area 
yi = amount spent by B in 7th area 


Then the total amount of A’s sales will be 


n om 
& es ta 3 
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and the total amount of B’s sales will be 


. vi 
2 ¢ + =) %; 


Since the total sales S(=s,) is a constant, A will maximize his total 
sales if and only if he maximizes the difference between his sales and 
B’s sales, That is, A wishes to maximize the quantity D, where 


n — -" 
(7.16) ED ah ee tt) 
far Ui + Yi 

This can be regarded as a zero-sum game. A chooses a set of positive 
x; adding up to X; B chooses a set of positive y; adding up to Y. A’s 
payoff is D(a, +++ %n, Yi *** yn), B’s payofi is —D(x1 +++ Xn, Y1 +++ Yn)> 
The simple methods of the previous section, however, do not suffice 
to solve this game, since the competitors each have a non-denumerably 
infinite number of courses of action. 

Unfortunately, a motivated discussion of the solution of continuous 
games cannot be presented within the confines of an example. We 
shall, however, state the solution and prove that it comprises the op- 
timal strategy for each competitor. The solution is 


(7.17) “o ax for all i 
8 2 
(7.18) v= 3 Y for alli 
and with these values for x; and yz 
pies 
(7.19) X¥+Y 


To prove that this is a solution, suppose that A adopts the strategy 


(7.17), and that B deviates slightly from (7.18), viz.: 


Si 
(7.20) WAS Y+ A; 
where each A; is small and DA; =0. The payoff D’ to A is now 
expressed by 
» pai(X — Y) = | 
y= a 3; 
ay et 2 [ena 
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We note that each term in the sum has the form 
bole 
b+e 
with «small. By elementary expansions, we easily obtain the approxé 


mate result 
a—e ka ax € ¢ =) ¢| 
a =f = So —+—)2 
‘yaa | a rae aR 


Applying this to (7.21), with k = s;,a = s(X — Y),b = s,(X + Y), 
= SA, the result is 


2 (xX-Y SA; SA; 
! = ee kel | phd FE 
B Lier s(X—¥)  s(X + Y) 


Sa? Sta? }} 
+ oP) t ors rl! * 


ky: s| 1 “4 1 ps 
X-+Y Sey Oo ae yee 


vlptatatal 
- YP" (X+ VA 
Since [(X — Y)/(X + Y)]S = D, and 2A; = 0, we may write D’ as 


Di = D+ (SX + m[14 2 oF] [2—] 


Note that each term in brackets is intrinsically positive. This implies 
that D' > D; we conclude that the strategy (7.17) used by A is optimal. 
In a similar fashion, we can show that (7.18) is the optimal strategy 
for B. 


Example 12 

The situation is the same as in the previous example, with the 
exception that B’s allocations to districts are arrived at by executive 
fiat, and are not the optimal allocations as indicated in the last example. 
If A knows B’s intended allocations, how should A react to take ad- 
vantage of the fact that B’s allocations are non-optimal? 
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SoLuTIon 
The function to be maximized with respect to the 2; is 


Pomees 
om G #) 8; Yi, ***, Yn Parameters 
7 tit ye 


subject to the restriction >) 2; = X. Here the Lagrangian multiplier 
7 
technique can be used. Form the functional 


TG Yi 
F= (2) a -a( x - x) 
x i+ Yi x ‘ 
Differentiating / with respect to each of the x; and \, we find 


2s 


—_ -aA=0 
(a1 + 91)? 


_ 2 ya 
(a2 + y2) 


2nYn 
Gat Yn)? 


Yu -X=0 
i 


rA=0 


A solution for is obtained by solving each of the first n equations for 
the term 2; + yi, and then adding, The result is 


2 
(¥ v *s) 
X+Y 
Thus, since the y; are parameters, we have 


Vow 


a teng gn Wet Al oc Us 
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To illustrate this result, let s; = 50, s2 = 100, 83 = 70, X i Y= 380. 
Suppose y; = 8, yo = 15, y3 = 7. The analysis is shown in Table 1. 
TABLE 1 


Minimax value 
8 uO V8K% xj of x; and y; 


50 8 20.0 6.8 6.8 
100 «15 «638.8 = 13.8 13.6 
70 7 22.2 9.4 9.6 


Notice that the 2,’s in the fourth column do not deviate very much 
from the minimax strategy as given by the previous example. 
PROBLEMS FOR SOLUTION 
Set IX 


1. The following games have saddle-point solutions. Determine the optimal 
minimax strategies for each player. 


6 8 6 
(@) | 412 2 
al PP er 
(b) 6-8-4 —_§ 
402 -3 
2. Find the solution of the following game: 
feat 
85 2 


[Hint: This problem may be worked by the algebraic methods of the text. 
However, the following graphical method, which is applicable to any m-by-2 
or 2-by-m game, may give quicker results. The restrictions on the x; may be 
rewritten as 


Ri: Ta +v<s, 
Re: 2 F< 5. 
Ry: -9m +0 < 2, 


Plotting the relaxed inequalities on a graph with 2; as abscissa and » as ordinate 
gives a diagram such as Figure 7.2. The pair (v, 2) in the final solution of the 
game must be below all three lines, and yet have as great a v as possible. 
(Why?) Clearly this pair is the point of intersection of Ry and Rs, which is 
% = 3/11, v = 49/11. So the value of the game is 49/11, and A’s optimal 
strategy is (3/11, 8/11). The final step is the calculation of B’s optimal 
strategy, which is easily done algebraically once » is known.] 
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Figure 7.2. Graphical solution of a 2-by-m game. 


3, Rework Example 6 of the text, taking advantage of the existing dominance 
relationship. 

(Hint; After eliminating B’s strategy 3, we have a 3-by-2 game. Apply the 
graphical method of the previous problem to obtain B’s optimal strategy, 
remembering that the payoff matrix for B is the negative of that for A.] 

[Ans. A: (0, 0, 1); B: (a, 1—a, 0) where a is an arbitrary number 
between 2/5 and 4/5; v = 2.) 


4, Aand B play a game in which each has three coins: a penny, a nickel, and 
adime, Each selects a coin without knowledge of the other's choice. If the 
sum of the coins is an odd amount, A wins B’s coin; if the sum is even, B 
wins A’s coin. Find the best strategy for each player and the value of the 
game. [Ans. A: (1/2, 0, 1/2); B: (10/11, 0, 1/11); » = 0.) 


5. Two items of value $80 and $100 are to be bid for simultaneously (with 
sealed bids) by A and B. Both bidders announce their intention of devoting 
a total of $110 to the two bids. If each uses a minimax criterion, what are the 
resulting bids? 

(Ans. $65 for the more valuable item, $45 for the less valuable.] 

[Hint: Let x be A’s bid and y be B’s bid for the more valuable item. If 
A wins with a bid of x, he does not win the second item, and his gain is 100—2. 
If A does not win with a bid of z, he wins the second item with a resultant 
gain of x—30. For a given z, let R(x) be the smaller of 100—a and x—30. 
Then R(x) is maximized by a value of x equal to 65.] 


6. If we build an m-by-n matrix by drawing each element at random from 

a continuous distribution, what is the probability that it has a saddle point? 

[Hint: Since the distribution is continuous, the matrix can have at most 

one saddle point. This means that saddle points in various positions of the 

matrix are mutually exclusive events, and hence that the probabilities of these 

events are additive. Calculate the probability that the upper left-hand corner 
element is a saddle point, and multiply by mn.] mit 
m!n 

[ An. (m+n—-} i} 
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CHAPTER 


Allocation 


Problems of allocation arise whenever there are a number of activities 
to perform, but limitations on either the amount of resources or the 
way they can be spent prevent us from performing each separate ac- 
tivity in the most effective way conceivable. In such situations we 
wish to allot the available resources to the activities in a way that will 
optimize the total effectiveness. 

The number of possible ways of allocating the resources to the ac- 
tivities can be finite or infinite. In problems with a finite number of 
choices we could, in theory, enumerate all possible choices. Unfortu- 
nately, in practical problems enumeration is far too lengthy; for 
example, there are 10! ways of allocating 10 contracts one apiece to 10 
contractors. In the case of continuous choice (e.g., division of capital 
between possible investments), the restrictions on the choice variables 
are often of such a nature as to preclude the use of calculus. 

It is only in recent years that mathematicians have realized that, 
for many practical purposes, solutions in principle (e.g. enumeration) 
are insufficient. Naturally enough, the first situations to be discussed 
were the comparatively simple ones where the effectiveness and the 
restrictions are stated in terms of linear functions of the allocations. 
The analysis of these situations is called linear programming; the tech- 
niques used can be divided into three main groups according to the 


methods used for solution. 
183 
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Assignment Problems 


This group comprises problems where we are given a matrix of 
effectiveness, showing what happens when we associate each of a num- 
ber of “origins” with each of the same number of “destinations.’’ Each 
origin is to be associated with one and only one destination, and we 
wish to make the associations in such a way as to minimize (maximize) 
the summed effectiveness. The previously cited example of awarding 
ten contracts to ten contractors is such a problem. 


Transportation Problems 


This class of problems is a generalization of the type described in the 
first group. The matrix of effectiveness is no longer necessarily square; 
it shows what happens to effectiveness when we associate each of a 
number of origins with each of a possibly different number of destina- 
tions. We are given the total moyement from each origin, and the 
total movement to each destination, and we wish to determine how the 
associations should be made subject to the limitations on totals, 


Simplex Problems 


This group actually includes the first two groups, but the computa- 
tions are so lengthy that the methods of groups 1 and 2 should be used 
whenever possible. The type of problem that can be solved by the 
simplex method is best illustrated by an example, and we will not give 
a formal mathematical statement of the problem, at this stage. 

Suppose that a machine can make 400 items of product X per week 
and 300 of product Y. If production is changed from X to Y or vice 
versa during the week, a total of 600 items can be made. The profit 
from sales of X is $2 per item and the profit from Y is $5 per item. We 
wish to determine the amount x to be made of product X, and the 
amount y to be made of product Y, so that profit will be maximized. 

The mathematical statement is that we wish to maximize the expres- 
sion 

Qe + 5y 
subject to the restrictions 


0<2x<400, O<y<300, x«+y<600 


Generally, a simplex problem involves the maximization or minimiza 
tion of a linear function of a set of non-negative variables, subject to a 
set of linear inequalities relating the variables. 

It is not the purpose of this chapter to give detailed justification of 
the methods used in solving linear programming problems. We will 
outline some of the possible techniques and endeavor to make them 
sound plausible. It is well to bear in mind that, even with the tech- 


niques to be described, many practical problems involve so many vari- 
ables that they are beyond the capacity of clerks with desk computers. 
It is probably not mere chance that linear programming methods have 
been developed at the same time that electronic computers have be- 
come available. 


THE ASSIGNMENT PROBLEM 


The’ assignment problem is a type of allocation problem in which n 
items are distributed among n boxes, one item to a box, in such a way 
that the return resulting from the distribution is optimized. For 
example, a department head may have five people available for assign- 
ment and five jobs to fill; which assignments will be best for the com- 
pany? A trucking company has an empty truck in each of cities 1, 2, 3, 
4, 5, 6 and needs an empty truck in each of cities 7, 8, 9, 10, 11, 12; how 
should the trucks be dispatched so as to minimize the total mileage? 

The problem may be stated formally as follows. Given an n-by-n 
array of real numbers [c;;], where ci; is the individual return associated 
with assigning the 7thitem to the jth box. Find, amongall permutations 
(it, 4, +++, in) Of the set of integers (1, 2, +++, ), that permutation 
for which 

Cri, 1 Caiy FoF Onin 


takes its maximum (minimum) value. 

There are n! such permutations (i.e. n! ways of assigning n items to 
n boxes). The method for choosing the optimal permutation or 
“assignment” is best explained with examples. 


Example 1 
A department head has four subordinates, and four tasks to be 


performed. The subordinates differ in efficiency, and the tasks differ 
in their intrinsic difficulty. His estimate of the times each man would 
take to perform each task is given in the effectiveness matrix below. 
How should the tasks be allocated, one to a man, so as to minimize the 


total man-hours? 


Man 
T 0 gith Ay; 
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SoLuTIon 

There are 4! = 24 possible sets of associations that satisfy these 
conditions. The reader can write down all possible sets, together with 
the corresponding total man-hours. 

A more systematic procedure is as follows: 

We choose the smallest number in row A and subtract it from each 
element in the row. The result is: 


(eed St ae 63 
13 28 4 26 
38 19 18 15 
19 26 24 10 


Now suppose that we have assigned one task to each man. Whatever 
assignment we have made, the total man-hours for the new matrix will 
be 8 less than for the old matrix. Thus an assignment that minimizes 
the total for one matrix also minimizes the total for the other. 

This statement is generally true; the reader should verify the follow- 
ing theorem, which is the basis for our solution. 


Theorem 


If in an assignment problem we add a constant to every element of a 
row (or column) in the effectiveness matrix, then an assignment that 
minimizes the total effectiveness in one matrix also minimizes the total 
effectiveness in the other matrix. 


We now proceed to subtract the minimum element in each row from 
all the elements in its row, yielding 


018 9 38 
9 24 0 22 
23 4 3 «0 
9 16 14 0 


Next we subtract the minimum element in each column from all the 
elements in its column, obtaining 


Ori4e 9 8 
9 20 0 22 
23 0 3 0 


9 12 14 0 


Our final step depends on the obvious fact that, so long as our matrix 
consists of positive or zero elements, the total effectiveness cannot be 
negative for any assignment. Consequently, if we can choose an assign- 
ment that has a zero total, there cannot be an assignment with a lower 
total. In other words the total is certain to be minimum if all assign- 
ments can be made to positions where elements are zero. 

Thus we make the following assignments: 


A- Belll,, C-Il, | D-LV 


In view of the theorem these assignments will also minimize the total 
number of man-hours for the original matrix. 


In the example above, the reduction of the effectiveness matrix by 
subtraction resulted in an obvious solution to the original problem. 
We cannot always count on being so fortunate; all we can be sure of, 
after performing the reduction, is that we will end up with a matrix 
possessing at least one zero in each row and at least one zero in each 
column. A complete assignment may not be present among the zeros 
at this stage; or, if it is present, it may be difficult to recognize if the 
matrix is of large dimensionality. Thus two questions remain: 

1. Given a matrix with some zero and all non-negative elements, 
how do we look for the maximal existing assignment among the zeros? 

2. If a complete assignment does not exist among the zeros, how 
can we further modify the matrix by additions (subtractions) to rows 
or columns so as to obtain more zeros? 

Although algorithms exist which give a complete answer to the first 
question, they are too complex for inclusion here. For matrices of 
small dimensionality, the maximal existing assignment can usually be 
found by inspection or trial and error. With matrices of higher dimen- 
sionality, some degree of ingenuity or luck may be needed to find the 
maximal assignment in the absence of a formal algorithm. In all cases 
one would start by using the following rules: 

(a) Examine rows successively until a row with exactly one unmarked 
zero isfound. Mark (1) this zero, as an assignment will be made there. 
Mark (X) all other zeros in the same column to show that they cannot 
be used to make other assignments. Proceed in this fashion until all 
rows have been examined. ' 

(b) Next, examine columns for single unmarked zeros, marking them 
(CQ) and also marking with an (X) any other unmarked zeros in their 
"oO Repeat (a) and (6) successively until one of two things occurs: 
(1) there are no zeros left unmarked, or (2) the remaining unmarked 
zeros lie at least two in each row and column, 


133. Operations Research: Methods and Problems 


In outcome (1) we have a maximal assignment. In outcome (2) we 
must use ingenuity and/or trial and error in order to build up to a 
maximal assignment if we are to avoid the use of a highly complex 
algorithm, Regardless of how the maximal assignment is obtained, we 
can say that, if it has an assignment in every row, this maximal assign- 
ment is a complete solution to the original problem. If it does not con- 
tain an assignment in every row, we are faced with the question of 
modifying the effectiveness matrix by addition or subtraction. Before 
we discuss this, we give an example of finding maximal assignments. 


Example 2 

Given an effectiveness matrix with zero elements in the positions 
shown, and positive non-zero elements elsewhere, show how to obtain 
a maximal assignment. 


0 

0 0 
0 0 0 
0 0 0 
0 0 0 


SonuTion 


Row 1 has a single zero in column 2. Make an assignment, and 
delete the second zero in column 2. 


0) 


0 & 

Ov rg 0 

0.14.0 0 
0 0 0 


Row 2 now has a single zero in the first column. Make an assignment, 
and delete the remaining zeros in column 1, 


[0] 
[0] x 
x ) ty) 
>: 4 0 0 
0 0 0 


All the remaining rows have at least two zeros left; so we now ex- 
amine columns. Column 4 has a single zero in row 5; so we make an 
assignment and delete the remaining zeros in row 5. 


(ol x 

Raianho 0 

x 0 0 
K OK 


Both the remaining rows and columns have two zeros. Make an 
assignment in the position (3, 3), and delete the remaining zeros in row 
3 and column 3. This leaves one zero at (4, 5) and we make the last 
assignment. 


{o) 
(o] « 
x oO xK 
xo x fo) 
«K OK 


We see there are no remaining zeros, and every row has an assign- 
ment, Since no two assignments are in the same column (they cannot 
be if the procedure has been correctly followed), the maximal assign- 
ment is a solution. 

Two remarks should be made at this point, concerning the operation 
of this procedure in practice. First, there may be more than one 
maximal assignment. Second, it is not necessary in practice to rewrite 
the matrix repeatedly; this was done in the example only to give clarity 


to the presentation. 


We now return to our other question: if the maximal assignment does 
not constitute a complete solution, how should we add further zeros? 
We will give the following rules, and hasten to add (without proof) that 
it can be shown that their repeated application must lead to an optimal 
assignment in a finite number of iterations. Starling with a maximal 


assignment: 


(a) Mark all rows for which assignments have not been made. 
(b) Mark columns not already marked which have zeros in marked 


TOWS. 
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(c) Mark rows not already marked which have assignments in 
marked columns. 

(d) Repeat steps b and ¢ until the chain of markings ends. 

(e) Draw lines through all wnmarked rows and through all marked 
columns. 

{If the procedures have been carried out correctly, there should be 
as many lines as there were assignments in the maximal assignment, and 
every zero will have at least one line through it. Moreover, this method 
yields the minimum number of lines that will pass through all zeros.] 

(f) Having drawn the set of lines in steps a through e, examine the 
elements that do not have a line through them. Select the smallest of 
these, and subtract it from all the elements that do not have a line 
through them. Add this smallest element to every element that lies 
at the intersection of two lines. Leave the remaining elements of the 
matrix unchanged. 

Example 3 


Show how to construct the minimum number of lines that will pass 
through all the zeros of the matrix below. 


SoLuTIon 
We first mark the maximal assignment. 
(0) 
4 x Vv 
@ «x x 

(0) V 
ol lv 

Vv Vv 


Then we mark row 2 as having no assignment and columns 1 and 4 as 
having zeros in row 2. 
Next mark rows 4 and 5 because they contain assignments in marked 


columns. The procedure leads to no further marked rows or marked 
columns. 


The minimum set of lines that will cover all zeros is the set through 
rows 1 and 3 (unmarked) and columns 1 and 4 (marked). 


Example 4 


Modify the matrix below so as to obtain a better maximal assign- 
ment. 


5 0 8 10 ll 
OG. to Ung 
85 0 0 0 
OG ad: 
35 6 0 8 


SoLuTrIon 

The zeros are in the same position as in the previous example; so we 
already have the maximal assignment and the lines. These are as 
follows: 


Select the smallest element not deleted by a line; in this matrix it is 
3 in row 2, column 5; subtract this element from every element that 
does not have a line through it, and add it to every element that lies 
at an intersection of two lines. 

The new matrix is: 


8 fo) 8 13 11 
wsp XK 
11 5 O] 8 «K 
fo)}3 1 2 4 
32 3 O 5 
and we now find that we have a complete assignment in, positions with 


zero elements [(1, 2); (2,5); (3,3); (4, 1); (5, 4)). Tf the maximal 
assignment did not constitute a solution to the original problem, we 
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would proceed to draw lines and continue to iterate until we finally ob- 
tained a solution. 


It is not essential to follow the above procedures exactly in drawing 
lines. In fact so long as we can cover all zeros in an n-by-n matrix with 
fewer than n lines (horizontal or vertical) it can be shown that the 
maximal assignment is not yet a complete solution. The disposition 
of the zeros can always be improved by applying step f on page 190 
(subtracting the minimum element) to any set of less than n lines cover- 
ing all zeros. However, if it is possible to obtain a set with fewer lines, 
the final result may be obtained with fewer iterations. 

No general statement can be made as to whether the extra work of 
finding the maximal assignment at each stage can be justified by the 
savings in iteration. It appears to be a matter of personal taste, and 
the reader who prefers may choose to draw any covering set of less than 
n lines (preferably as few as possible) and use them for step f instead of 
the minimum set. 

Tt can be shown that, if it is impossible to cover the zeros with less 
than n lines, then a complete assignment among the zeros is possible 
at that stage. 

Summarizing, we have the following rules for solving the assignment 
problem. 


(a) Modify the effectiveness matrix by subtracting the minimum 
element of each row from all the elements of the row. Then modify the 
resulting matrix by subtracting the minimum element of each column 
from all the elements of the column, obtaining the first reduced matrix. 

(b) Find a maximal assignment for the first reduced matrix. If this 
maximal assignment is a complete solution, the procedure ends at this 
point. If the maximal assignment is not a complete solution proceed 
to step c. 

(c) Draw lines to cover all the zeros. Subtract the smallest element 
not covered by a line from all the elements not covered; and add this 
smallest element to all elements lying at the intersection of two lines, 
obtaining the second reduced matrix. 

(d) Repeat steps b and c on the second reduced matrix, and so on 
until a complete solution is obtained, 


PROBLEMS FOR SOLUTION 


Set X 


1, A national car-rental service has a surplus of one car in each of cities 
1, 2, 3, 4, 5, 6, and a deficit of one car in each of cities 7, 8, 9, 10, 11, 12. The 
distances between cities with a surplus and cities with a deficit are displayed 
in the matrix below. How should the cars be dispatched so as to minimize 
the total mileage traveled? 

To 


78 9 10 11 12 


From 


On Rm wo we He 


2. A small airline, operating seven days a week, serves three cities (A, B, and 
C) according to the schedule shown in Table 1. The layover cost per stop is 


TABLE 1 
Flight Time of Time of 
no. From departure To arrival 
1 A 9:00 am B Noon 
2 A 10:00 am B 1:00 pm 
3 A 3:00 pm B 6:00 pm 
4 A 8:00 pm Cc Midnight 
5 A 10:00 pm Cc 2:00 am 
6 B 4:00 am A 7:00 am 
7 B 11:00 am A 2:00 pm 
8 B 3:00 Pm A 6:00 pm 
9 Cc 7:00 am A 11:00 am 
10 ¢c 3:00 pm A 7:00 pm 


roughly proportional to the square of the layover time. How should planes be 
assigned to flights so as to minimize total layover cost? 


3. Solve Example 1 of the text, by enumeration. 


4. Prove the theorem quoted in the text: If in an assignment problem we 
add a constant to every element of a row in the effectiveness matrix, then 


i Operations Research: Methods and Problems 


an assignment that minimizes the total effectiveness in one matrix also min- 
imizes the total effectiveness in the other matrix. 


5. A company has four territories open, and four salesmen available for as- 
signment. The territories are not equally rich in their sales potential; it is 
estimated that a typical salesman operating in each territory would bring in 
the following annual sales: 


Territory I: $60,000. 
Territory II: 50,000. 
Territory III: 40,000. 
Territory IV: 30,000. 


The four salesmen are also considered to differ in ability; it is estimated that, 
working under the same conditions, their yearly sales would be proportionately 
as follows: 


Salesman A: 7. 
Salesman B: 5, 
Salesman C: 5. 
Salesman D: 4, 


If the criterion is maximum expected total sales, the intuitive answer is to 
assign the best salesman to the richest territory, the next best salesman to the 
ean richest territory, and so on. Verify this answer by the assignment 
method. 

(Hint: To find an assignment with maximum value, multiply each of the 
elements in the effectiveness matrix by —1, and solve the latter assignment 
problem in the usual way.) 


6. Show that the procedure of subtracting the smallest element not covered 
by a line from all uncovered elements and adding the same element to all 
elements lying at an intersection of two lines results in a matrix with the same 
optimal assignments(s) as the original matrix. [Hint: Use the theorem quoted 
in problem 4 above.] 


7. Show that the line-drawing procedure given just after Example 2 in the 
text results in a system of lines that pass through all the zeros, and with the 
property that every line passes through one and only one assignment. (Hint: 
If a row is unmarked, it has a line through it; if a row is marked, there ire 
vertical lines through all its zeros. This establishes the first statement, To 
prove the second statement, note that each unmarked row has an assignment 
in an unmarked column, and hence that each horizontal line goes through an 
assignment. Assume that there is a vertical line which does not go through 
an assignment. By tracing the chain of markings back to the initial ioe 


markings, show that the present assignment is not maximal, This i 
the assumption of a maximal assignment.] : oO 


THE TRANSPORTATION PROBLEM 


The transportation problem can be regarded as a izati 
5 generaliz: 
the assignment problem. It will be recalled that one ee. ye 


the assignment problem is that we have a number of origins each 
possessing one item, and the same number of destinations each requir- 
ing one item, and are asked to empty the origins and fill the destinations 
in such a way that the total effectiveness is optimized. In the transpor- 
tation problem, we have m origins, with origin 7 possessing a; items, and 
n destinations (possibly a different number from m), with destination j 
requiring b; items, and with Za; = 2b;. We are given the mn costs 
associated with shipping one item from any origin to any destination, 
and are asked to empty the origins and fill the destinations in such a 
way that the total cost is minimized. 

The problem may be stated formally as follows. Given an m-by-n 
array of real numbers (¢;;), as well as two sets of positive integers 
(a1, a2, +++, @m) and (by, be, --+, bn) with 


m n 

ha = 2); 

i=1 j=l 
Determine, among all m-by-n arrays (2;;) of non-negative integers 
that satisfy 


m 
Day = 0; — for each j 


i=l 
as well as 


n 
xij =a;  foreachi 
j=l 
that array (x;;) for which the quantity 
PE: LiiCiz 
ty 
takes its minimum value. Here ¢;; represents the cost associated with 
shipping one item from origin 7 to destination j, and 2;; represents the 
allocation from origin 7 to destination j. 

The assignment problem, in this terminology, is seen to be a special 
case of the transportation problem in which m = n, all the a; and b; 
are unity, and each 2;; is limited to one of the two values 0 and 1. In 
these circumstances, exactly n of the 2;; can be non-zero, one in each 
row of the array and one in each column. 

The iterative routine to be presented as the mode of solution of the 
transportation problem involves a succession of “feasible solutions” 
of gradually diminishing cost; eventually the minimum achievable cost 
is reached, and the corresponding feasible solution is optimal. By a 
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“feasible solution” we mean a set of individual allocations which 
simultaneously removes all the existing surpluses and satisfies all the 
existing deficiencies. The next example shows ways of finding an initial 
feasible solution to be used as the starting point in the algorithm. 


Example 5 

A company has four warehouses and six stores. The warehouses 
altogether have a surplus of 22 units of a given commodity, divided 
among them as follows: 


Warehouse Surplus 
1 5 
2 6 
3 2 
4 9 


The six stores altogether need 22 units of the commodity. Individual 
requirements are: 


Store Requirements 
1 4 
2 4 
3 6 
a 2 
5 4 
6 2 


Costs of shipping one unit of the commodity from warehouse 7 to store 
j are displayed in the following matrix: 


Warehouse 


9 
5 
3 
2 


~ wo Ww 


Find feasible (not necessarily optimal 


) solutions ‘ 
tied , and the cost associated 
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SoLuTIon 
The first step is to draw up a blank m-by-n matrix complete with 


row and column requirements, as follows: 


Available 


Required 4 4 6 2 4 2 


We wish to put a set of allocations in the cells so that row totals and 
column totals will be as indicated. 

One method is to start at the upper left-hand corner, which we may 
call cell (1, 1), and allocate as much as possible there: in other words 4. 
This satisfies the requirement for column 1, and leaves a surplus of 1 
unit for row 1; so we allocate 1 to cell (1, 2). Now the allocations are 
complete for column 1 and row 1; and we have a remaining deficiency 
of 3 in column 2, Allocate 3 in position (2, 2); now we have columns 1 
and 2 complete and a surplus of 3 in row 2. Continuing in this way, 
from left to right and top to bottom, we will eventually complete all the 
requirements by an allocation in the lower right-hand corner. The 


resulting feasible solution is: 


Available 


Required 
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To obtain the cost for the feasible solution, we multiply each individual 
allocation by its corresponding unit cost, and add. The resulting cost 
is 139. 

A better method of finding a feasible solution, in that it usually gives 
a lower beginning cost, is described next. We first write down the cost 
matrix, together with row and column identifications and row and 
column requirements: 


Store 
1 2 3 4 5 6 Available 


1 5 
6 
Warehouse 
3 2 
4 9 


Required 4 4 6 2 4 2 


The next step is to enter the difference between the smallest and second 
smallest elements in each column beneath the corresponding column, 
and the difference between the smallest and second smallest elements 
of each row to the right of the row. These differences are the numbers 
in parentheses in the matrix following this, paragraph. The first 
individual allocation will be to the smallest cost of a row or the smallest 
cost of a column; we choose that one for which there is the greatest 
penalty for not choosing it. That is, we choose the minimum cost loca- 
tion in that row or column whose corresponding number in parentheses 
is the largest. As 5 is the largest number in parentheses, we choose 
column 6 as the line for the first individual allocation, and allocate as 
much as we can to location (2, 6), the minimum cost location in this 
column. Thus 2 units are allocated to location (2, 6) as indicated by the 
small numeral in the upper left corner of that cell; and this completes 
the allocations for column 6, so that the other allocations in this 
column are zero, 
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Completes 
column 6 


Warehouse 


Required 4 4 6 2 4 2 
© @ @ @ @ vd 


The next step is to write down the shrunken cost matrix comprising 
the rows and columns whose allocations are not yet determined, in- 
cluding revised row and column totals which take into account the 
allocations already made. Now 4 is the largest unit penalty; this leads 
to an allocation in the corresponding minimum cost location in column 
4: namely cell (4,4). The maximum possible allocation is 2; so we 
allocate 2 units to cell (4,4), and 0 units to the remaining cells in 
column 4. 


Store 
1 2 8 4 5 Available 
5 (8) 
4 


Warehouse Completes column 4 


Required 4 4 6 2 4 
(0) (2) (2) @ () 
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Next, we write down the new cost matrix with column 4 also deleted, 
and proceed as before. The successive resulting matrices are set down 
below. 

Store 
1 2 3 5. Available 


4 (2) 
Warehouse 


Required 4 4 6 4 
0) @) @ @ 


Store 
1 2 3. Available 


5 (0) 
44 
2 (1) 
3 (2) 


Warehouse Completes row 2 * 


Required 4 4 6 
) @) @ 


5 (0) 
2 (1) Completes row 3 
3 (2) 


Required 4 4 2 
(0) . (0) 


* The reader may have noticed that the authors have failed 
correctly. The allocation of 4 should be in row 2, column 2. i: saa ocue 
solution obtained will be found to be optimal by the methods of example 6. How- 
ever, a feasible solution has been obtained and, despite the error, is made a timal 
in example 6. , : 
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Store 
1 2 3. Available 


0 
iLolele] s 
Warehouse = non Completes column 2 


3 (2) 
Required 4 2 2 
(3) (4) (2) 
T 
Store 
13 Available 


1|'0 [9] 5 (0) 


Warehouse 0 Completes matrix. 
4 \*6 | ink 1 6)< 


Required 4 2 

(3) (2) 
Copying the various positive allocations as they occur in the successive 
stages, we obtain as our feasible solution: 


Store 


1 2 3 4 5 6 


Warehouse 


The cost for this solution turns out to be 127. 

The unit penalty method above appears to involve a lot of rewriting 
of the cost matrix; but much of the repetition was done for clarity of 
exposition rather than from necessity. In practice, a person can carry 
through the method if he wishes on the original cost matrix entirely, 
crossing out rows and columns as they become completed, and working 
with the remaining matrix after correcting the corresponding row sums, 
column sums, and unit penalties. In any case, the time saved in avoid- 
ing a long set of iterations more than justifies the extra time spent in 
obtaining the initial feasible allocation. 


||) Soe Operations Research: Methods and Problems 


Now that we know how to obtain a feasible solution, how do we test 
this solution for optimality? Before we give the answer to this question, 
we must introduce the notion of “independence” for a set of cells in a 
rectangular matrix. A set of allocations comprising a feasible solution 
are said to be in “independent positions” if it is impossible to increase 
or decrease any individual allocation without either changing the posi- 
tions of allocations or violating the row or column restrictions. Both 
feasible solutions obtained in the previous example have this property. 
For example, suppose that, in the feasible solution obtained by the 
first method, the allocation in cell (2, 2) is changed from 3 to 2. Then 
the allocation in cell (1, 2) must change from 1 to 2, to preserve the sum 
for column 2; the allocation in position (1, 1) must decrease from 4 to 3, 
because of the restriction on row 1; and finally an allocation in a new 
position in column 1 is required in order to satisfy the requirement for 
column 1, A simple criterion for independence is that it is impossible 
to travel from any allocation back to itself by a series of alternating 
horizontal and vertical jumps from one occupied cell to another, without 
a direct reversal of route. The occupied cells in the following matrix, 
for example, are not in independent positions because of the “loop” 
connecting cells (2, 2), (2, 3), (3, 3), and (3, 2). 


Non-independent 
positions 


| 
“rhea he 


The optimality test of this section is applicable to any feasible 


solution of an m-by-n transportation problem which satisfies the 
following two conditions: 


1. It consists of exactly m-++-n—1 individual allocations: 
2. These allocations are in independent positions, 


(Later we shall explain how any feasible solution can be modified to 
meet these conditions.) 


The underlying motivation of the optimality test will be indicated 
briefly. We start with an arbitrary empty cell (one lacking an alloca- 
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tion) and allocate +1 unit to this cell. In order to keep the row and 
column sums fixed, this perturbation necessarily produces a change of 
—1 in some occupied cell of the same row, which in turn requires 
another change of +1 in another occupied cell of the corresponding 
column, and so on. If the feasible solution. satisfies the two require- 
ments above, this chain of events resulting from the allocation of one 
unit to an empty cell is consistent (i.e., we end up with a new feasible 
solution which satisfies the restrictions on row and column totals) and 
unique (i.e., one and only one such consistent cycle of changes is 
possible.) The net change in total cost resulting from the perturbation 
is called the evaluation for the empty cell in question; it may be regarded 
as a sort of first difference, analogous to the first derivative of the 
calculus. If the cell evaluation is positive, the perturbation would 
increase the cost; if it is negative, it would decrease the cost. Since 
there are mn—(m+n—1), or (m—1)(n—1) empty cells, there are 
(m—1)(n—1) such cell evaluations which may be calculated. If all the 
cell evaluations turn out to be positive or zero, the feasible solution 
being tested must be optimal (minimal). Actually, it is not necessary 
to compute each cell evaluation individually; the optimality test routine 
set down below gives a method for calculating all of them simultane- 


ously. 


Optimality Test 
Starting with a feasible solution consisting of m-+-n—1 allocations in 
independent positions: 


1, Determine a set of m-+-n numbers 


05 j=1,2,-+-,n 
such that, for each occupied cell (r, s), 
Cre = Uy + Ue 


(Recall that c,s is the individual cost associated with an allocation from 
origin r to destination s.) If the feasible solution has the demanded 
properties, this can easily be done by inspection as in Example 6 


below. 
2. Calculate the cell evaluations A,, for each vacant cell (r, s) by 
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means of the formula 
Ars = Crs — (Ur + 05) 
(See Problem 3 of Set XI for a suggestion of the mode of proof of this 
formula.) 
3. Examine the matrix of cell evaluations for negative entries. If 
none are negative, the solution under test is optimal. If none are 


negative and any are zero, other optimal solutions exist. If any are 
negative, the solution under test is not optimal. 


Example 6 
Test the lower-cost feasible solution obtained in Example 5 for 
optimality. 


SoLuTion 


The allocations for the feasible solution with cost 127 fall in the 
following positions: 


The positions are seen to be independent, and there are 9 of them 
(m-+n—1); so the optimality test routine is applicable. Since the u; 
and v; are to be determined by means of the unit costs in the occupied 
cells only, we write in the corresponding unit costs: 
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We assign a u-value of any particular amount to any particular row; 
thus u4 is taken as 0. Since we want u;-+v; to equal c,; for occupied cells 
(z, j), this value for wg leads to v; = 6, v2 = 8, vg = 2, and vs = 2; and 
we have 


4» 6 8 2) 2 


The subset of the u; and »; already determined leads successively to 
uy = 8, ug = —3, vg = 6, ug = 1, v6 = 4. Our complete set of u; and 


v; becomes 
Us 


6. oe nnd 


(Our starting point could have been any particular value for any partic~ 
ular one of the w; or 2;; it is slightly more convenient to start with zero 
for the row with the most allocations. No matter which row or column 
or which number we start with, the mn numbers (w;+v;) will not 
change.) as Fe ; F 
To compute the matrix of cell evaluations, it is convenient to write 
down the matrix of the corresponding unit costs, and the matrix of the 
numbers (u;-+»;), and then subtract the latter matrix from the former: 


Matrix of unit costs 
(empty cells only) 


Matrix of (u; + 0;) 


Matrix of cell evalua- 


Since one of the cell evaluations turns out to be negative, we conclude 
that the feasible solution under test is not optimal. We show how to 
obtain an optimal solution in the following example. 
Example 7 

Obtain an optimal solution to the transportation problem stated in 
Example 5, using as a starting point the feasible solution tested in the 
previous example. 
SoLuTion 

For convenience we recopy the initial feasible solution: 


Available 


6 Initial 
feasible 
solution 

2 —cost 127 


Required 
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The check mark indicates the position of the empty cell for which the 
corresponding evaluation turned out to be negative in the last example. 

Recall that an evaluation of —N for an empty cell means that an 
allocation of +1 unit to that cell will reduce the achieved cost by NV. 
If it is possible to allocate +-2 units to that cell without making any of 
the other allocations negative, the achieved cost will be reduced by 2N, 
and so on. Since the aim of the iteration is to arrive at the minimum 
cost as quickly as possible, we shall allocate as much as possible to the 
empty cell with the negative evaluation (or, in general, to the empty 
cell with the highest negative evaluation). 

To perform this re-allocation, we first identify the loop joining the 
empty cell to occupied cells, as indicated below. (Recall that a loop is 
a closed succession of horizontal and vertical jumps between cells in a 
matrix.) 


Loop joining cell 
(2, 2) to oceu- 
pied cells 


It is seen that we can allocate at most 2 units to cell (2, 2) and still 
satisfy the row-and-column-total and non-negativity restrictions on the 
allocations. (If we were to allocate 3 units to the empty cell, the net 
allocation to cell (4, 2) would have to be —1 if we are to preserve the 
totals for row 3 and column 2.) The effect of allocating 2 units to the 


empty cell is 


Re-allocation 
among cells of 
loop 


The four allocations which do not belong to the loop are carried for- 
ward without change, and the new feasible solution becomes 
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Available 


6 Second 
feasible 
solution 

2 —cost 115 


Required 4 4 6 2 4 2 


The cost for this new feasible solution is computed to be 115, which is 
12 less than the cost for the initial solution; a reduction of this size is 
anticipated because we are allocating 2 units to a cell where each unit 
results in a cost change of —6. 

The second feasible solution again has 9 allocations in independent 
positions, and the optimality test is applicable. The result is: 


Ww 
1 Determina- 


tion of u; 
and v; 


The solution is still not optimal. 
The steps for the next iteration, which turns out to yield an optimal 


solution, are: 


Loop joining cell 
(3, 1) to occu- 
pied cells 
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Available 


Third 
feasible 

6 solution 
—cost 112 
—nine 

2 independ- 
ent allo- 
cations 


Us 


—2 Determi- 
nation of 
u; and 0; 


Matrix of cell 
evaluations 


Since all the cell evaluations are non-negative, the third feasible 
solution is optimal, and 112 is the corresponding minimum cost. 
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In the above example, the iterative routine could be carried out 
without difficulty, because each successive feasible solution was 
“permissible”; i.e. it consisted of m--n—1 allocations in independent 
positions. The remainder of this section deals with the modifications 
which must be made when, as frequently happens, a feasible solution 
at some stage does not satisfy the required conditions. 

Tf a feasible solution has allocations in positions which are not inde- 
pendent, there will be a loop (or loops) connecting occupied cells. Such 
loops can be eliminated by re-allocating among cells in the loop, 
as in Example 7; and the result will be a feasible solution in 
independent position. Once a feasible solution is put in independent 
position, later steps in the iteration will all result in solutions in inde- 
pendent positions. Thus the independence requirement is not hard to 
fulfill, But the requirement concerning the number of allocations is 
more difficult to meet, because, even if we start with a permissible 
solution, the number of allocations is likely to drop below m-++n—1 at 
any stage of the iteration. 

The following example gives a case where the number of allocations 
drops below the required number after the first stage of the iteration, 
and shows how one can proceed to an optimal solution. 


Example 8 ; ’ 
Obtain an optimal solution to the transportation problem stated in 


Example 5, using as a starting point the following feasible solution: 


Available 

5 

6 Initial 
feasible 
solution 

2 —cost 127 

9 

Required 
nous nine allocations in independent posi- 


Since the initial solution has 


tions, we may proceed to test for optimality and iterate as before: 


Us 


1 Determina- 
tion of u;and 
v; 


3 


Matrix of cell 
evaluations 


Loop joining cell | 
2, 6) to occu- 
pied cells 


Re-allocation 
among cells of 
loop 


6 Second 
feasible 
solution 

2 —cost 115 
—only 
eight 

9 allocations 


Required 4 4 6 2 4 2 


A feasible solution with independence, but with fewer than the 
required number of individual allocations, is altered to become per- 
missible in the following manner. First, the requisite number of vacant 
cells is chosen, so that (a) this number of cells plus the existing number 
of allocations is m-+n—1, and (b) these m+-n—1 cells are independent. 
(This can always be done if the solution we start with is in independent 
positions.) Then, an infinitesimal but positive allocation, of amount e, 
is given to each of the chosen vacant cells. This fiction does not appre- 
ciably alter the physical nature of the original set of allocations, but 
will be found to suffice for carrying through the iterations. 

In our example, we need to make but one infinitesimal allocation. 
One of the empty cells which would suffice for this is cell (1, 2); we 
allocate ¢ to this cell, and note that the resulting set of nine allocations 
are in independent positions. 


Revised second 
feasible solution 


Now that we have a permissible set of allocations, we can apply the 
optimality test and iterate: 


UW 


1 Determina- 
tion of u;and 
0; 


Matrix of cell 
evaluations 


Loop joining cell 
(1, 8) to occu- 
pied cells 


Re-allocation 
among cells 
of loop 
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Available 


Third fea- 
sible solu- 
6 tion— 
cost 112— 
nine inde- 
2 pendent 
allocations 


Required 4 4 6 2 4 2 


uy 
0 


1 Determina- 
tion of us 
and 0; 


Matrix of cell 
evaluations 


Loop joining cell 
(2, 3) to occu- 
pied cells 


At this point we note that the maximum amount we can move into 
the designated cell is the infinitesimal amount «. We do this, even 
though the new solution will have the same achieved cost as the old 
one, because it places us in a position to carry out later iterations if 
needed. 


Re-allocation 
among cells of 
loop (ignore 
infinitesimal 
changes in non- 
infinitesimal 
allocations) 


Available 


Fourth 
feasible 
6 solution— 
cost 112— 
nine inde- 
2 pendent 
allocations 
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—2 Determi- 
nation of 
u; and vj 


Matrix of cell 
evaluations— 
indicates fourth 
feasible solution 
is optimal 


Inasmuch as the infinitesimal quantity « plays only an auxiliary role, 
it is removed when the optimum js reached, and our final answer 1s: 


Available 


6 Optimal 
solution 
—cost 112 

2 


9 


Required 
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Note that the optimal solution is somewhat different from that 
reached in the previous example, though of course both achieve the 
same minimum cost. 

It is quite possible that a feasible solution may degenerate to m-+-n —3 
or even fewer independent allocations—compare any assignment 
problem, where any feasible solution consists of exactly n allocations. 
In such cases, if the transportation method of this section is to be used 
in solving the problem, two or more ¢’s must be used. The e’s placed 
in various independent locations are distinguished from each other by 
subscripts, and some convention is made about their relative magni- 
tudes. 

A rule that will suffice is to read across the rows from left to right, 
starting with the #6p row, and call the first « encountered the smallest, 
the next one the second smallest, and so on. 


PROBLEMS FOR SOLUTION 


Set XI 


Ve ie the following transportation problems. (Cell entries represent unit 
costs. 


Available 
8 


(a) 


anc oO NN 


Required 


) 
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Required 
4 


(c) 


nF ww 


Available 5 2 3 1 2 2 


2. A department store wishes to purchase the following quantities of ladies’ 


dresses. 
Dress type A B cD Bs 
Quantity 150 100 75 250 200 


Tenders are submitted by 4 different manufacturers who undertake to supply 
not more than the quantities below (all types of dress combined). 


Manufacturer W X Y Z 
Total quantity 300 250 150 200 


The store estimates that its profit per dress will vary with the manufacturer 
as shown in the matrix below. How should ordera be placed? 


Dress 
Aue Bales ae 


2.75 3.50 4.25 2.25 1.50 
3.00 3.25 4.50 1.75 1.00 
2.50 3.50 4.75 2.00 1.25 
3.25 2.75 4.00 2.50 1.75 


Manufacturer 


Nowe OS 


[Hint: Add a “dummy dress” for which there will be no profit, to represent 


unused capacity of manufacturers.] 
for the simultaneous calculation 


3. Justify the formula given in the tex! n u lati 
‘ons. That is, if we have a feasible solution consisting 


of all the cell evaluati e onsit 
of m-++-n—1 independent allocations, and if we have the u; and »; satisfying 
Up +00 = Ore for each filled cell (r, 8) 


show that the evaluation Ay corresponding to each empty cell (i, 7) is given by 


Ay = 045 — (Ui +) 


1 the simple case where the empty cell is 


[Hint: First verify the formula fo whe 
This is done by allocating 


connected to filled cells by a square loop as below. 
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+1 unit to cell (i,7), and —1 unit to the next cell Occupying a corner of the 
loop, and so on, and computing the resulting change in cost. 


Then generalize your proof to the case of a loop of arbitrary shape connecting 
cell (i, 7) to filled cells. 


THE LINEAR PROGRAMMING PROBLEM 


Tn this type of problem we wish to maximize the linear function 


(8.1) Z = (yx, + Cote +--+ + Crag 
over all those points in the region 
(8.2) % 20, 20, +++, %>0 


which also satisfy the m linear inequalities 
M121 + Ayo%e +++++ ayyr, < by 


(8.3) Gait + Aoot +-+++ angry < by 


mits + mamta ++ +++ dmg, < By 


We can assume without loss of generality that the m expressions com- 
prising the left-hand side in (8.3) are linearly independent, and also 
that each restriction is satisfied by some but not all points (x1, x2, 
+++, 2%) with non-negative coordinates. The latter assumption means 
that, for each restriction (say, the rth), at least one Coefficient a,; has 
the same sign as b,. 

In this section and most of the next, we shall limit ourselves to the 
case where all the b; are non-negative, Later, in Example 11, this 


limitation will be abandoned. Problem 3 of Set XII gives some dis- 
cussion of the case where some of the restrictions are equations rather 
than inequalities. 

_ If Z is a function of only one variable x, the problem of maximizing 
Z is trivial. In the cases where Z is a function of two or three variables, 
there is a geometric solution to the problem, which parallels the general 
algebraic method to be discussed in the next section. We present the 
geometric approach through a numerical example. 


Example 9 
Find the non-negative values of x, y, 2 that maximize the expression 


Z = 32+ 5y + 42 


subject to the restrictions 
2x + 3y <8 


(8.4) Qy + 52 < 10 
3a + 2y +42 < 15 


SoLuTION 

If we consider x, y, and z to be coordinates in three-dimensional 
space, the non-negativity restrictions mean that we are concerned only 
with the region consisting of points whose coordinates are all either 
positive or zero, However, we are not free to choose any point in this 
region because of the restrictions (8.4). In fact we are limited to points 
that lie within or on the boundary of the solid enclosed by the six planes, 
whose equations are 


Qa + By = 8 z=0 
2y + 5z = 10 y=0 
8x + 2y + 42 = 15 z=0 


This solid [0ABCDEFG] is shown in Figure 8.1. Now consider the 
plane whose equation is 

galing Bo 
oa vat Vea" * Ve0 


textbooks on analytical geometry that p is the per- 
of the plane from the origin. Now Z = p V 50, so 
alue for any point of the plane (8.5). If we think 


It is shown in 
pendicular distance 
that Z has the same vi 
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Figure 8.1. Region of admissible solutions in three dimensions (Example 9). 


of p as a variable parameter, (8.5) will represent a family of parallel 
planes, and maximizing Z is equivalent to finding that member of 
the family which is furthest from the origin, while still having at least 
one point within or on the boundary of the solid OABCDEFG. 

A little reflection will show that such a plane must pass through one 
of the corners of the solid OABCDEFG. Consider Figure 8.2, where P 


Direction of 
Perpendicular from origin 


Figure 8.2. View of one vertex. 


is any corner, and JHK is one plane of the family. It is clear that the 
parallel plane J’H’K’ still has points Within the solid and that it is 
further from the origin. Thus, until we move J’H' ’K’ as far as P, there 
will be parallel planes further from the origin with points within the 
solid. Of course, there may be other parallel planes further from the 
origin and with a point within OABCDEFG, but the plane for which 
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we are looking must pass through some corner. In other words, the 
maximum value of Z is taken at one of the corners of the solid bounded by 
the restricting planes. This result, or its algebraic analog, is the key 
theorem to all that follows. 

Having decided that we can confine our search for the maximum Z 
to corners, we could first find all the corners (as in Figure 8.1) and then 
evaluate Z at each, until we found the maximum. Unfortunately 
finding all the corners is a long and tedious process, and we are led to 
wonder if a more systematic approach is possible. ‘The most commonly 
used approach consists of four steps. 

1. Find one corner as a starting point. 

2. Examine each edge through the corner to see whether movement 
along it increases Z. If movement along no such edge increases Z, the 
corner maximizes Z. If movement along at least one edge increases Z, 


proceed to step 3. 
3. Choose one of the edges along which movement increases Z, and 


move along it until the next corner is reached. 
4, Repeat steps 1, 2, 3 until Z cannot be increased further. 


In order to do this with reasonable algebraic simplicity we introduce 
the so-called slack variables u, v, w, defined by the equations 


Qa + 8y+u=8 
(8.6) Qy + 5z+v= 10 
Ba + 2y + 42+ = 15 


It will be realized that u, v, w are the deficiencies when the inequality 
signs hold in the original restrictions. Moreover, if x, y, 2 are to satisfy 
these restrictions, w, v, w must be non-negative. 

If we rewrite (8.6) as 


u=8 — 2x — 3y 
(8.64) v = 10 — 2y — Se 
w = 15 — 3x — Qy — 42 
e that an, two of the six equations u = 0, v = 0, w =0,2=0, 
‘is ae 2=0 pelea together one of the edges of the solid OABCDE- 


igi ich i ly a corner of the 
FG. Now suppose we start at the origin which is clearly t 
solid. Thus ee choosex =y =27=0 and read off the corresponding 


values of u, », was 8, 10, 15. 
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The three edges through the origin are represented by the pairs of 

equations 

c= O c= 7 y= , 

y=0 z=0 2=0 
If we move along these lines, we increase z, y, x, respectively. Each 
unit increase in z, y, z increases Z by 3, 5, 4, respectively; so we decide 
to increase y as much as possible, and for the time being to leave x and 
2 as zero. 

Referring to (8.6A) we see that we can increase y by 8/3 without 
making u negative; we can increase y by 5 without making v negative, 
and we can increase y by 15/2 without making w negative. Thus the 
maximum increase in y is 8/3, and we move from the origin to the point 
B(0, 8/3, 0). Next we re-arrange equations (8.6) so that we can read 
off the non-zero variables at B, just as previously we read off the non- 
zero variables at the origin. To do this we solve (8.6) for v, w, and y 
(the non-zero variables at B) in terms of 2, 2, and wu (the zero variables 
at B), An easy way to do this is to solve the first equation of (8.64) 


for y, and then substitute this expression for y in the other two equa- 
tions of (8.64). We obtain 


ge ty t qua $§ 

fz + 52 - Jut+v=14 

fetde—Jutw= ap 
(8.7) or 

y=$-je-—qu 

o = Tt + $c — be + 3u 

w= 3 — fo — de + Fu 

The edges through B are given by 


z=0 z=0 
BO 
Jo Tj ST haa 


z=0 

z2=0 
We have just moved along BO; so we consider moving along BG and 
BA. 


If we move along BA, we will have to increase z, and from the first 
equation in (8.7) we see we will have to decrease y by 2/8 for every unit 


‘ 
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increase in x. Thus, for every unit increase in x, we will change Z by 
3% 1—5 X 2/3 = —1/3. Thus we do not want to move along BA. 

If we move along BG, we increase z, and we see from (8.7) that 
zx and y do not change as z increases. Consequently, increasing z in- 
creases Z by 4 per unit. From the second of equations (8.7), we can 
increase z by as much as 14/15 without making v negative; and, from 
the third equation, we can increase z by as much as 29/12 without 
making w negative. Thus the maximum permissible increase in 2 is 
14/15, and we move from B to G. 

Again we express the non-zero variables (y, z, and w) in terms of the 


zero variables 
fetyt as 
ase +e — eu t+ fu = ts 
fhe -— Pou — fot = 38 
(8.8) or 
y= h-hh 
em ttt she + vu 
w = $8 — thet tou + bo 


and repeat the operations, arriving at the point F with coordinates 
(89/41, 50/41, 62/41). Here we discover that movement in any direc- 
tion decreases Z, and we conclude that Z takes its maximum at the 
vertex F. 

The geometric method for the simplex has been described here in 
order to motivate the algebraic presentation which follows, and is not 


recommended as & calculational procedure even in the three-dimen- 


sional case. From here on, we will be thinking of the algebraic analog 
of the geometry already discussed, and will develop the simplex tableau 
which is used for calculation. (A geometric procedure which is practical 
for the two-dimensional case is sketched in problem 1 of Problem 


Set XII.) 


THE SIMPLEX METHOD 


The first step in the simplex method for solving the general problem 
posed at the beginning of this section is to convert the system of m 
linear inequalities (8.3) to a system of m linear equations. This is done 
by the insertion of the so-called “slack variables” 2k41) Tk4+2) °**) Tk-+m; 
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one for each of the displayed inequalities so that (8.3) becomes 
41121 + Gyota +-+-+ Gyr, + eq = by 


Qg1%1 + Agoto ++++-+ Gopty + Teig = be 
(8.9) . 


OnLy ~F Amote +++ Ampte + Thm = bmn 


The linear programming problem may now be restated as follows: 
to find, among all sets of points (1, 22, ---, 244m) satisfying (8.9) with 


each 2; non-negative, that point (41, £2, «++, %,4m) which maximizes the 
linear function 

(8.10) Z = Cyr, + Coreg +++++ Cyne 

Once the maximal (%,,%2, +++, @:4m) is found, the answer to the 


problem as originally stated is just the subset (4, £2, -+ +, %). The 
slack variables are merely a device used in arriving at the optimal 
solution, and may or may not have physical meaning in individual 
problems. 

The key theorem underlying the simplex approach says, in effect, 
that the set (21, 22, «+, %m+4k) Which maximizes (8.10) subject to (8.9) 
must necessarily possess / elements which are zero. If we knew which 
coordinates were zero at the optimum, we could insert their values in 
(8.9); the latter system would then be a set of m equations in m un- 
knowns, and could be solved algebraically for the optimum. Without 
this knowledge, we must be content to proceed in a step-by-step fashion 
toward a solution, The procedure may be visualized as follows. We 
choose k of the k-+-m coordinates, and assign them the value zero. 
Using these values, we then solve the restriction equations for the 
remaining m coordinates. If any of the coordinates are negative, we 
discard this solution as violating the non-negativity restriction, and 
try another set of & coordinates for the zero variables, If none of the 
coordinates are negative, we have what is called a feasible solution, 
which is the starting point of the iterative process. The simplex tech- 
nique permits us to test the feasible solution for optimality, and, if it is 
non-optimal, to proceed directly to a different feasible solution with an 
improved Z. It can be shown that eventually the technique must take 
us to an optimal solution. 

A feasible solution for the nth stage is related to the feasible solution 
for the n+ Ist stage in the following way. One of the zero coordinates 
for the nth stage becomes non-zero in the n+-1st stage, and is called an 


Allocation bas abatlite?A sdsqagenh anoresen() _W.. me 24 


entering variable. To compensate, one of the non-zero coordinates in the 
nth stage becomes zero in the n-+Ist stage, and is called a departing 
variable. The other zero coordinates remain zero; and the other non- 
zero coordinates in general remain non-zero (though their values may 
change). 

In what follows, we shall use repeatedly the concept of a matrix 
representation of a system of linear equations, and shall also assume 
that the reader is acquainted with the procedure for manipulating rows 
of the matrix until a solution to the original system may be read off. 
The reader who is not familiar with these topics is advised to read 
Appendix III before proceeding further with this chapter. 

The simplex routine is presented next. For definiteness, we shall give 
the tableau for the ease of five original variables and four restrictions 
[k = 5, m = 4 in (8.9) and (8.10) above]. 


Step One 
Convert the m restrictive inequalities to m equations, by the insertion 
of the non-negative slack variables 7,41, Tk42) ***) Tk-+m) one to each of 


the m restrictions. Write this system of equations in matrix form. 
Mark each column in some way so as to designate which variable is 
served by that column of coefficients. (Below, we mark the column of 
coefficients of x; by P;, and use Po to designate the column of constant 
terms, Insert also a row C, where C; is the coefficient of x; in the objec- 


tive function Z. 


P, P2 Ps Pe Ps Po Px Ps Py Po 


Cj C1 Cs O; &@ Cs 0000 


Step Two 
and insert it as a row below the coef- 


Read off a feasible solution, 1 ) 
ficient matrix. The feasible solution at any stage 18 read off by choosing 
the non-zero variables to be those whose columns consist of 1 and the 


rest zeros. Insert P-designations Pj to the left of each row of the 
coefficient matrix, to indicate which non-zero coordinate value is read 
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from that equation. Insert also a column of Cjj, where Cjq is the 
coefficient of x;q in Z. 

If all the b; are positive, a feasible solution is obviously 2; = a2 = 
2g = 1 = ty = 0, Xe = bi, 27 = bo, tg = bg, to = bg. The tableau 
becomes: 


Ci Pia P, Ps Ps Ps Ps Pe P; Ps Po Po 
0 Po 
OicPi 
O Ps 
0 Ps 


GC, C2 Cs Oo Cs 0000 
Solution 0 0 0 O O b be bs by 


(If not all the b; are positive, some preliminary manipulation is 
necessary before the feasible solution can be read off. See Example 11, 
page 234.) 


Step Three 


Test the solution for optimality. This is done by computing an 
“evaluation” A; for each zero variable x;, by the formula 


(8.11) Ay = 0; — DaijCia 
ied 


Tf one or more of the evaluations A; are positive, the solution is non- 
optimal, and we proceed to step four. If A; < 0 for each J, the solution 
is optimal. (Note: If none of the evaluations is positive, but if any are 
zero, other optimal solutions exist, with the same value for Z. If all 
the evaluations are less than zero, the achieved optimal solution is 
unique.) 

The interpretation of the evaluations is as follows. Let Zo be the 
value of Z for the feasible solution under test, and let Zp be the value 
of Z that would result if the feasible solution were altered viz.: In- 
crease the zero coordinate 2; from 0 to 1, retain the other zero coordi- 
nates as zero, and adjust the non-zero coordinates so that the restriction 
equations remain satisfied. Then 


(8.12) Ay = Zo — 2p 
The proof of (8.12) is left to the reader as an exercise, 
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The tableau has now grown to: 
Ca Pra Py P2 Ps Ps Ps Pe Pr Ps Po Po 
0 Ps 
0). Py 
0 Ps 
0 Ps 


CG; C2 Cs Ce O% 0000 


Solution 0 0 0 0 0 by by bs by 
ay Ay dn Ap Ag db ———— 


(At this stage, and indeed at any. stage of a problem where the set of 
non-zero variables is just the set of slack variables, it is seen that 
A, = C; for each j. But such a relationship will not hold for later 


iterations.) 


Step Four 

Determine the entering variablea,. Recall that the entering variable 
is that zero variable in the feasible solution which is to become non-zero 
in the next iteration. An entering variable 2; must have the following 
properties: 

(i) It is zero in the present tableau. 
(ii) A; 20. 
(iii) Atleast one of the aj; in the column for j must be greater than 
zero.* 

Any variable 2; with these properties could be used, in the sense that 
the resulting feasible solution in the next iteration will produce a value 
of Z at least as great as that in the present tableau. The following 
empirical rule for choosing the entering variable has been found to 
produce quick convergence. Choose from among the possible entering 
variables that zero coordinate 2; for which 4, is largest. 


Step Five \ 
Determine the departing variable z,. The departing variable is that 
non-zero coordinate (9 which is to become zero in the next iteration. 


* Any variable 2; which has no element aj jn its column greater than zero may 
be increased indefinitely without violating the original restrictions. If such a 


variable also has a positive Aj, Z may be increased indefinitely merely by increasing 


z;. Such cases can hardly arise in models of physical problems. 


2 Operations Research: Methods and Problems 


The choice of departing variable is, in general, determined by the choice 
of the entering variable, together with the non-negativity restrictions 
on the coordinates. If «, is the entering variable, the rule is: s will be 
that [i] for which the quotient b,/a;, takes its smallest non-negative 
value; and x, will be the departing variable. (Note: Since we assumed 
earlier that the problem has been put into such a form that all the b; are 
positive, we are really selecting from among positive a;, in the rth 
column.) 

We now have all the information we need to complete the simplex 
tableau for this stage of the iteration. For definiteness, let the entering 
variable be xg and the departing variable be 27. 


Ca Pia Py Ps Ps Py Ps Po Pr Ps Po Po 
(Na af 3 431 Gyn G13 Ay a5 by 
OyiaPz Qn 22 dy Aas by 
0 Ps G31 G32 G33 A345 dy 
0 Py M1 4. O43 4a ba 


Cf 33),.0u nails oi Geno 0g 040510 


Solution 0 0 0 0 0 dy be bs by 
A; Ai As As Ay: Ay —— —— 


ii 
Entering Departing 
variable variable 
Ea v 


The reason for distinguishing the element in the column labeled Pe. 
and the row labeled P, will become apparent in the next step. 


Step Six 

Calculate the new coefficient matrix by means of suitably chosen 
row operations on the old coefficient matrix. The new coefficient 
matrix is to have unity in the position distinguished in step five, and 
zero elsewhere in that column. When this is accomplished, the new 
matrix will be such that the new feasible solution can be read off 
immediately. 

Computationally, we proceed as follows. Divide all elements of the 
distinguished row by the distinguished element of that row, thus 
obtaining unity in the distinguished position. Then subtract appropri- 
ate multiples of this new row from the other m—1 rows, so as to obtain 
zeros in the remaining positions of the distinguished column, 


If we use a’ ;;, b’; to denote the elements of the new coefficient matrix, 
the new matrix will be: 


Py Pa Ps Ps Po Po Px Ps Ps Po 


The b’; will all be non-negative, because of the way the entering and 
departing variables were chosen. 


Step Seven 
Repeat steps one through six as needed, until an optimum is finally 


attained in step three. 
The steps are performed, of course, on the new coefficient matrix, 


so that for example (8.11) becomes 
m 
ay = 0 — DC 
inl 


andsoon. The complete second tableau, before entering and departing 


variables are chosen, is given below. 


Cw Pa Pi Pa Pa Py Ps Po Py Ps Po Po 


CG; C1 Cy C3 Ca Cs 00 00 


Solution 0 9 b20 «0 
A; 44 Ay — 4% Ay —41 -— 


Example 1 


0 
Rework Example 9 (the geometric example) by the simplex method. 
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SoLuTion 
To agree with the notation of the simplex routine presented above, 
we rename the variables , y, 2 as 21, T2, X3, and call the slack variables 
24, Xs, Xs. Then the problem becomes one of maximizing 
Z = 8x, + 52 + 425 
subject to the restrictions 
2a, + 322 +%= 8 
2x2 + 5x3 + 25 = 10 
Ba, + 2x2 + 423 + xg = 15 


The first tableau becomes 


Cn Pra Py Ps Ps PE Py PE Po 
0 P, 
os Ps 
Ps 
CG =6:8 Bane, 6.70. 6 
Solution 0 0 0 8 10 15 Z=0 
45 3 5 4-—-—- — ( ) 
a3 —- — = f ap age 
Enter- —_ Depart- 


i ing * 
cabiotle variable 


To obtain the second tableau, we first calculate the intermediate 
coefficient matrix 


025010 


324001 


*Recall that the entering variable z_ is determined by the largest 4;, and 
the departing variable 2 is determined by the smallest b;/aj,2. 


and by row operations we get the coefficient matrix for the second 
‘tableau: 


In this example, it turns out that four tableaus are needed before 
an optimum is reached. The second, third, and fourth are obtained 
in the manner described, and are set down below. 


Szconp TABLEAU 


Cra Pia P, Pa Ps Pa Ps Po Po 
0 Ps 
0 Ps 
Gi" eB) Biokusll Dy Oesu 
Solution Gist HO GARR (a= 49) 
4 + — *# ore 
b(/air | . Fe it ss 
Entering Departing 


Turrp TABLEAU 
Py Ps Pa Py Ps Pe Po 
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Fourra TasiEau 


Ca Pa Pr Pa Ps Pi Ps Ps Po 
8 Fi)” Noa Bob ap YH fas | ae 
4 P, 00 1 -& rid dr | $f 
3 Pi) [1 0 0 -w -# ap lae 
G78 5 4 0 0 0 
Solution if 3f 0 0 0 (Z = 783) 
Ay imuF avin Meee brite th 


Since all the A; are non-positive, the fourth tableau yields an optimum. 
The optimal solution is (89/41, 50/41, 62/41), giving a Z of 765/41. 


Tt has already been pointed out that the routine given above cannot 
even start if some of the constant terms b; are negative. The reason is 
that, if a b; is negative, the solution read off from the original coefficient 
matrix contains the negative coordinate z,(= b,), and is therefore not 
feasible. Once a feasible solution is obtained, by ruse or inspiration, 
the simplex routine can proceed as given, 

The following example illustrates how a feasible solution can be 
obtained when some of the b; are negative. 


Example 11 


Obtain a first feasible solution in the case where some linear function 
Z(21, £2, 3) is to be maximized subject to the restrictions 


% +2 +23 <8 
2 — 2a + 23 < —2 


(8.13) 


So.vution 


If we merely introduced the slack variables x4 and %5, So that the 
restrictions became 


% +22 +23+2,=8 
—% — 29 + 23 +25 = —2 


the solution read off would be (0, 0, 0, 8, —2), which is not feasible. 
But, if we also introduce a so-called “artificial variable” %m, under- 


stood to be positive and not greater than zs, the restrictions (8.13) can 
be rewritten as 


%] + %+%3+a,=8 
—% — 2x2 + 23 +25 — 25 = -—2 
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or better 
+ tq +73 +4 =8 


@ + 2x2 — 43 — Xp + 45 = 2 
The first tableau becomes 


Pa P, Ps Ps Pa’ Ps Pe Po 


Py 1) 1 oy AO REO 
Ps 1 [2] -1 0 -1 1 


Solution 0 H 


2 

1 

The solution consists of non-negative coordinates, but isnot yet feasible, 
since xg > xs, and the second restriction in (8.18) is violated, 

We now transform the equations as in the simplex routine, using the 
artificial variable zg as the departing variable. We choose as an enter- 
ing variable one of the original variables %, 72, 73, being careful to 
choose one that will yield a transformed matrix with a non-negative 
constant column, In this example, we may choose 72 if we wish as the 
entering variable. The next tableau becomes 


P, Ps Pa Paks Py Po 


Solution 0 1 ier aoe TU 


The solution (0, 1,0, 7, 0,0) 3s feasible, since it consists of non- 


negative coordinates which satisfy the pera en paths 
ficial variable az» has served its purpose, and os sir the 2-by-5 
problem. To maximize Z(%1, ta a), we would s 


tableau: 
P, P2 Ps Pa Py Po 
5 0 diteman? 
po spe 


Solution 0 1 9 vom 
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(If there were more than one negative b;, a corresponding number of 
artificial variables would be introduced. Then the artificial variables 
would be removed one at a time and replaced by the original and slack 
variables.) 

Since the arithmetic of the simplex method is fairly lengthy, it may 
be advisable to perform certain checks on a tableau before proceeding 
to the next tableau. One obvious check is that the Z value obtained 
for a tableau should not be less than the Z value for the previous 
tableau, A more precise check relies on the fact (not proved here) that 
the row of evaluations A; may be obtained from the A; row of the 
previous tableau, in the same way that the rows of the coefficient matrix 
are obtained from the previous coefficient matrix. That is, to obtain 
the new row of evaluations we subtract from the old evaluation row 
the product of A, times the distinguished row divided by the distin- 
guished element of that row. [This procedure could have been used to 
compute the new A;; then the check would consist of the application 
of equation (8.11).] 

The discussion of the text has been confined to problems where Z is 
to be maximized subject to restrictions, each requiring a linear function 
of the 2; to be less than or equal to some constant. However, this is no 
real limitation. If Z is to be minimized, we merely regard the problem 
as one of maximizing (~Z). Further, any restriction of the form 


Dhar; = b; 
can be rewritten as 
2(—kj)aj3 S —b; 


APPLICATIONS OF LINEAR PROGRAMMING 


Although the arithmetic technique used in linear programming 
problems is long and tedious, it is essentially a mechanical repetition 
of a sequence of precisely defined steps. The real task for the operations 
research worker is to recognize that his problem can be solved by linear 
programming and to formulate a model that will lead to a useful solu- 
tion, In this section we will give some illustrative examples of problems 
that can be solved with the simplex technique. 


Example 12 

A company makes two kinds of leather belts. Belt 4 is a high- 
quality belt, and belt B is of lower quality. The respective profits are 
$0.40 and $0.30 per belt. Each belt of type A requires twice as much 
time as a belt of type B, and, if all belts were of type B, the company 
could make 1000 per day. The supply of leather is sufficient for only 
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800 belts per day (both A and B combined). Belt A requires a fancy 
buckle, and only 400 per day are available. There are only 700 buckles 
a day available for belt B. Set up the linear programming equations 
for the problem. 


SoLuTIon 
Suppose the company makes 2 belts of type A and za belts of type B 


each day. 
From the restrictions on the number of buckles available we have 


x, < 400 
aq < 700 


From the availability of leather we have 

2 + 22 < 800 
and from the limitation on time 

2x, + 22 S 1000 


Introducing the slack variables x3, 14, Zs and 2», we can write the 


restrictions as 
2, +23 = 400 


aa +24 = 700 
a, +72 +25 = 800 
2x, + 22 + xo = 1000 


The profit will be 
Z = 0A0x; + 0.3022 


and we wish to maximize Z over non-negative values of 1, 72°", * 


subject to the above restrictions. 


Example 13 ‘ 
A east of whiskey imports three grades A, B, and C. He mixes 


rding to recipes which specify the maximum or minimum 
pl of yale ‘A and C in each blend. These are shown in Table 
2. 
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TABLE 2. SpsctricaTions oF BLENDS 


Blend Specification Price per fifth 
Blue Dot Not less than 60% of A $6.80 
Not more than 20% of C 


Highland Fling Not more than 60% of C $5.70 
Not less than 15% of A 


Old Frenzy Not more than 50% of C $4.50 
Supplies of the three basic whiskeys together with their costs are 
shown in Table 3. 
TABLE 3. AvamLasiuity AND Costs oF INGREDIENTS 


Maximum quantity 
Whiskey available, fifths perday Cost per fifth 


A 2000 $7.00 
B 2500 5.00 
c 1200 4.00 


Show how to obtain the first matrix in a simplex computation of a 
production policy that will maximize profits. 


SoLuTION 

It is clear that, if there were no limitations on supplies, the blender 
would use the maximum quantities of the cheaper of the whiskeys that 
the specifications allow. In Old Frenzy he would use 50% C and 50% 
B. The average cost per fifth of the mixture would be $4.50, and at 
this price there would be no profit. Any other blend meeting the speci- 
fication would result in a loss, and we may conclude that Old Frenzy 
will not be made. We can thus reduce the problem to the blends Blue 
Dot and Highland Fling only. 

We will use the following notation. 


211 = quantity of A used for Blue Dot 
212 = quantity of B used for Blue Dot 
213 = quantity of C used for Blue Dot 
21 = quantity of A used for Highland Fling 
Zo = quantity of B used for Highland Fling 
23 = quantity of C used for Highland Fling 


An amount 211 + 212 + 213 of Blue Dot will be produced and sold for 
6.80(211 + 212 + 213). An amount za; + X22 + 2X3 of Highland Fling 
will be produced and sold for 5.70(721 + atg9 + 293). The total amounts 


ee a ee ee ee eee 
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of each ingredient used will be (11 + %a1) of A, (ay2 + 222) of B, and 
(e13 + 22a) of C. 
Hence the total profit Z is given by 
'% = 6.80(a11 + t12 + 213) + 5.70(e21 + t22 + X23) 
—7.00(a11 + 221) — 5.00(e12 + 22) — 4.00(@13 + 228) 
So Z = —0.2024; + 1.80219 + 2.80213 — 1.30221 + 0.70222 + 1.70223. 
Thus we wish to maximize Z, subject to the limitations imposed by 


the specifications and the availability. 
From the availability 


(8.14) 1 + 221 < 2000 
(8.15) X12 + X22 S 2500 
(8.16) 213 + 223 < 1200 


From the specifications for Blue Dot, 
(8.17) ay > 0.60(t11 + 212 + 213) or 8x12 + 3%13 — 231 <0 
(8.18) 213 <0.20@11 +212 +413) OF 413 — Tu — Te <0 
From the specifications of Highland Fling, 
(8.19) 2g < 0.60(w21 + a2 +223) or 2x23 — 3:21 — 3x22 < 0 
(8.20) a1 > 0.15(wa1 + 222 + 223) OF 322 + 38%23 — 17%to, <0 


We now introduce the slack variables x; «++ 7 into the inequalities 
(8.14) through (8.20) and obtain the first tableau given below. 


(ord 0 0 0 0 0 0 0 —0.20 1,80 2.80 —1.30 0.70 1.70 
Py) | Pi Po Ps Ps PsPoPr Pu Pu Pis Pa Po Pa 
P;|2000} 1 1 1 
P2| 2500 1 1 1 
P3|1200 1 1 1 
Py 0 1 —2 3 
Ps 0 1 -1 -1 4 
Ps 0 1 —-3 -3 2 
P; 0 1 -17 3.3 
(i ls lene tee aaa RSE 
Solu- 2000 2500 1200 0000 0 0 0 0 0 0 
ti 
y iy ee = Bees 0.20 1802.80 —1.30 0.70 1.70 
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Example 14 

A steel mill produces three types of coils, each made of a different 
alloy. The process flow chart looks like Figure 8.3. The problem is to 
determine the amounts of each alloy to produce, within the limitations 
of sales and machine capacities, so as to maximize profits. 


Figure 8.3. Flow chart for Example 14. 


Data on capacities and profits are given in Tables 4 and 5. 


TABLE 4 
No. of 8-hour Shifts Down time, 
Machine machines per week % 
Box anneal 4 21 5 
Strand anneal 1 20 10 
Tandem roll 1 12 0 
TABLE 5 
Alloy Operation Machine rate Sales potential Profit per ton 
1 Box anneal 28 hr per 10 tons 1250 tons $25 
Tandem roll (1) 50 ft per min per month 
Strand anneal 20 ft per min 
Tandem roll (2) 25 ft per min 
2 Box anneal 35 hr per 10 tons 250 tons $35 
Strand anneal 20 ft per min per month 
Tandem roll 25 ft per min 
3 Strand anneal 16 ft per min 1500 tons $40 
Tandem roll 20 ft per min per month 


Coils for each alloy are 400 feet long and weigh 4 tons. Set up the 
objective function and the restrictions, from which a simplex solution 
to the manufacturer’s problem might be obtained. 


SoLuTIon 
Let z; (¢ = 1, 2,3) be the number of tons of alloy i produced each 
month. We wish to maximize 


C = 25a, + 3522 + 4023 


subject to limitations imposed by the available machine capacity and 
sales potential. 
We first compute the machine capacity in hours per month (Table 6). 


TABLE 6 
Shifts % 
No. of per useful Capacity, hr 
Machine machines week time per month 
Box anneal 4 21 9 4xX21xX8xX 44 xX 0.95 = 2766.2 
Strand anneal 1 20 90 1X20*8 X44 x 0.90 = 624 
Tandem roll 1 12.100 1X12xK8x 4} = 416 


We next translate the various machine rates into tons per hour 
(Table 7). 


TABLE 7 
Alloy Operations Machine rate 
1 Box anneal 28 hr per 10 tons = 0.357 ton per hr 
4 
Tandem roll (1) 50 ft/min = ae 190 ton p68 he 
Strand anneal 20 ft/min = 12 tons per hr 
Tandem roll (2) 25 ft/min = 15 tons per hr 
2 Box anneal 35 hr per 10 tons = 0.286 ton per hr 
4 
Strand anneal 20 ft/min = oar = 12 tons per hr 
‘Patdemooll 25 ft/min = axes sud ivtons petthr 
3 Sissnd anneal 16 ft/min = ee gc tous perihr 
Tandem roll 20 ft/min = 12 tons per hr 


We now compute the machine times required for x1, 2, 3 tons of 
alloy 1, 2, 3, and use these machine times for a formal statement of the 


capacity restrictions: 
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For box anneal: 


(8.21) ee 


0.357 0.286 
For tandem roll: 
%  % , % , 4% % . Xo Xe 
For strand anneal: 
(8.23) 2 oe 
12 12 9.6 
In addition we have the limitations imposed by the sales potential. 
(8.24) a < 1250 
(8.25) 22 < 250 
(8.26) x3 < 1500 


Examination of the restrictions (8.21) through (8.26) shows that (8.24) 
is redundant, since, even if r2 = 0, the restriction (8.21) implies that 


a < 987.5 < 1250 
Thus we wish to maximize 
C = 25a, + 3522 + 4023 
subject to the restrictions (8.21), (8.22), (8.23), (8.25), (8.26). 


Example 15 

The strategic bomber command receives instructions to interrupt the 
enemy’s tank production. The enemy has four key plants located in 
separate cities, and destruction of any one plant will effectively halt 
the production of tanks. There is an acute shortage of fuel, which 
limits the supply to 48,000 gallons for this particular mission. Any 
bomber sent to any particular city must have at least enough fuel for 
the round trip plus a reserve of 100 gallons. 

The number of bombers available to the commander and their 
descriptions are listed in the following table. 


Miles 
Bomber per Number 
type Description gallon available 


1 Heavy 2 48 
2 Medium 3 32 
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Information about the location of the plants and their vulnerability 
to attack by a medium bomber and a heavy bomber is given below. 


Distance 
from base, Probability of destruction by 


Plant miles aheavy bomber a medium bomber 


1 450 0.10 0.08 
2 480 0.20 0.16 
3 540 0.15 0.12 
4 600 0.25 0.20 


How many of each type of bomber should be dispatched, and how 
should they be allocated among the four targets, in order to maximize 
the probability of success? (Assume that no damage is inflicted on a 
plant by a bomber that fails to destroy it.) 


SoLuTIon 
Let xi; be the number of bombers of type 7 sent to city j: 


i= 1,2; j =1,2,3,4 
We wish to maximize the probability of destroying at least one plant, 


and this is equivalent to minimizing the probability of not destroying 
any plant. We will use Q to denote this probability. So 


Q = (1 — 0.10)7(1 — 0.20)*#(1 — 0.15)##(1 — 0.25)" 
(1 — 0.08)#(1 — 0.16)"(1 — 0.12)"%(1 — 0.20)" 


We now have to impose the restrictions on the x;; due to availability 
| of fuel and aircraft. 


Fuel 
2 X 450 2x 480 2x 540 2 x 600 
tu + Z12 a3 + rem 
2X 450 2 xX 480 2x 540 2 X 600 
Zi + Zoo + 3 223 + 3 Do4 


+ 100(a11 + t12 + 213 + 214 + Tar ++ wa9 + 223 + tea) < 48,000 
Thus 


(8.27) 55021 + 580212 + 64023 + 70014 + 400221 
+ 420.99 + 460223 + 500721 < 48,000 
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Aircraft 
(8.28) 2 + M12 + 213 + tu S 32 
(8.29) Xa1 + eq + X23 + toa S 48 


Now, although the restrictions on the 2; are linear, the function Qis 
not linear. However, minimizing Q is equivalent to minimizing log Q, 
and log Q is linear in the x;;. Moreover minimizing log Q is equivalent 
to maximizing — log Q = log (1/Q). Any convenient base of loga- 
rithms will do, and we will use a base 10, as logio is readily available in 
tables. Thus we wish to maximize 


v 
log Q = 0.045711 + 0.09691212 + 0.07041213 + 0.12483214 
+ 0.03623221 + 0.06558222 + 0.05538x23 + 0.0969 1x24 


subject to the restrictions (8.27) through (8.29). 


Example 16 
The sales forecasts for a certain product, by months, are given below. 


January 2,000 July 10,000 
February 3,000 August 6,000 
March 4,000 September 4,000 
April 6,000 October 3,000 
May 8,000 November 2,000 
June 10,000 December 2,000 


Tt costs $1.00 per unit to increase production from one month to the 
next, and $0.50 per unit to decrease production. Production scheduled 
for the month of December in the current year is 2000 units, and it is 
estimated that the inventory level on January 1 will be 1000 units. 
Storage capacity is limited to 5000 units at any one time. 

Show how to obtain a production schedule for the coming year that 
will minimize the cost of changing production rates, while at the same 
time insuring that sufficient stock will be available to meet the sales 
forecast at all times. (Assume that production scheduled during a 
month becomes available for shipment just in time to meet the current 
month’s sales demand.) 


SoLuTIOoN 

Let g; denote the production scheduled for the th month, where 
i =0 for this December, 7 = 1 for January, and so on. Then, if 
4% > 4-41, the change in production rate will cost 0.50 (¢; — 9:41), and, 
if gi41 > qi, the change will cost 1.00(¢:41 — 9). 
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We. introduce new variables. uj, 2: defined by 


is fe —-Gu Hf G@>an 
0 if gi S diq1 
(8.80) ’ 
se | 0 if ge > Gp 
‘law -a  f a Sait 
For any production schedule 91, g2, ***, 412, the series u; and v; are each 
uniquely defined, are non-negative variables, and have the further 
property that 
GG — GH = 4% 
for all 7. 
A given production schedule q1, g2, ***, 12 Will involve relevant costs 
equal to 
11 fu; 
(8.31) DY (0.50u; + 1.00v:) = 2 (“ + 2) 
i=0 i=o \2 
Since all sales forecasts are to be met, the sum of the initial inventory 


and the cumulated production must at all times exceed the cumulated 
sales. If we use S; to denote sales in month 7, this gives the restrictions 


i a 
(8.32) 100+ Da > wS  6=1,2-++12 
j=l j=l 
We also have the restriction that inventory at any time cannot exceed 
5000: 
7 a 
(8.33) 1000 + > aj — LS; < 5000 i= 1,2, +++, 12 
j=l jel 

We next express the restrictions (8.32) and (8.33) in terms of the new 

variables u; and v;. Since 


it 


G = 90+ > (G41 — H) 
= 


we have wh 
(8.34) ai = 9 — Ds (us — %1) 
j=0 
Thus 
ij 


Se, Se DL  — %) 


j=l j=l k=0 
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The latter equality reduces to 


a t—1 
(8.35) Da = ig — dG — j)(u; — 9) 
fs 


j=l 
and the restrictions (8.32) and (8.33) become 


i1 


1000 + igo — >> Gi — (uj — 4) = xs; 
j=0 j=1 


(8.36) 


i-l 
~ 2G — Duy — 4) — ES; < 4000 
j=l 
Since 7 ranges from 1 to 12, there are 24 restrictions in all included in 
(8.36). Twelve more restrictions arise from the fact that each g; = 0; 
from (8.34), the statement in terms of the new variables is 
int 
(8.37) LX (uy — 4) < 
j=0 
We can now state the scheduling problem as a linear programming 
problem. Given [see (8.31)] the linear form 


0 Else 


Find, among all non-negative u,; and v; Reneeise: the 36 restrictions 
(8.36) and (8.37), that set (uo «-+ w11¥ «++ M11) Which minimizes C. 

In order for the optimal solution so cueieed to be of any practical 
use, we must use the relation (8.34) to go from the w; and »; to the cor- 
responding g;. But (8.34) is based on the assumption that at least one 
of each pair (u;, ;) is zero. Apparently this is a defect in the solution, 
but actually it is not. In all 36 restrictions, the coefficient of any ui 
is minus the coefficient of the corresponding »;; and there is a theorem 
in linear programming which implies that, in such a situation, wu; and 0; 
(fixed 7) cannot both be non-zero in a feasible solution. 


PROBLEMS FOR SOLUTION 


Set XII 


1, Find values of z and y that maximize 
Z = 2x + by 


Work in two ways. [Ans. « = y = 300.] 

{Hint: Draw the region defined by the restrictions in the zy plane. Superpose 
on this region the lines of the family 22 + 5y = Z, where Z is regarded as a 
parameter. The line furthest from the origin which still intersects the region 
will determine the optimal solution (a, y). ‘A second method of solution is 
the algebraic simplex routine.] 


2. Use the simplex method to maximize Z = 5x — 2y + 32, where 2, y, z 
are non-negative variables subject to the restrictions 


QW +2y- 222 
Ba — 4y <3 
yt32S5 
[Hint: Use the method of Example 11 to obtain a first feasible solution.) 
3. Use the simplex method to maximize 
Z = 6x + 5y — 32 — 4w 
where 2, y, 2, w are non-negative variables subject to the following restrictions: 
On + By + 22 — 4w = 24 
x+2y < 10 
rt yt%t3wS ld 
y+ 2+ ws 8 


(Hint; Since one of the restrictions is in the form of an equality, some modifi- 
cation of the methods given thus far is needed. The temptation in such 
problems is to use the equality to eliminate one variable (say z) from the 
problem and solve for the optimum in terms of the remaining variables 4, 2, 
w. The flaw in this procedure is that, at the resulting optimum, # may turn 
out to be negative. The suggested method is to write the simplex array with 
four rows, introducing only three slack variables (one for each inequality), and 
treat one of the original variables (say a) as if it were a slack variable. If we 
choose z, we divide the first row by 2 and subtract suitable multiples of the 
result from the remaining rows, 80 as to yield an array in which zx has a coeffi- 
cient of unity in the first row and zero in the others. There is now a negative 
element (—2) in the Po column, second row. In order to obtain a feasible 
solution, use the methods of Example 11 of the text.] 

4. A ship has three cargo holds, forward, aft, and center. The capacity 
limits are: 

Forward 2000 tons 100,000 cubic feet 


Center 3000 tons 135,000 cubic feet 
Aft 1500 tons 30,000 cubic feet 
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The following cargoes are offered; the shipowners may accept all or any part 
of each commodity: 


Volume per ton, 


Commodity Amount, tons cu ft Profit per ton, $ 
A 6,000 60 6 
B 4,000 50 8 
Cc 2,000 25 5 


In order to preserve the trim of the ship, the weight in each hold must 
proportional to the capacity in tons. How should the cargo be distribu 
so as to maximize profit? (Note: Since the restrictions will contain an equality, 
the approach suggested in the previous problem may be needed in order to 
obtain a numerical answer.) 4 


5. A trucking company with $400,000 to spend on new equipment is contem- 
plating three types of vehicle. Vehicle A has a 10-ton payload and is expected | 
to average 35 miles per hour. It costs $8000. Vehicle B has a 20-ton payload 
and is expected to average 30 miles per hour. It costs $13,000. Vehicle C is 
a modified form of B; it carries sleeping quarters for one driver, and this: 


ceed 30. How many vehicles of each type should be purchased if the company 
wishes to maximize its capacity in ton-miles per day? 


6. A plant makes two products, A and B, which are routed through fo 
processing centers, 1, 2, 3, 4, as shown by the solid lines in Figure 8.4. 


{ (Alternative 
Y routing) 


Figure 2.4. Rovting of products A and 8 in problem 6. Each center can handle 
only one pass of one product at a time. ; 


there is spare capacity in center 3, it is possible to route product A through 
3 instead of going through 2 twice, but this is more expensive. 

Given the information below, how should production be scheduled so as to 
maximize profits? [By a “production schedule” is meant the specification of 
the following three amounts: (1) the daily amount of raw material used for A, 
regular route, (2) the daily amount of raw material used for A, optional route, 
(8) the daily amount of raw material used for B. Assume that sufficient storage 
capacity is available at no additional cost.] 


TABLE 8 
Input, gals % Running 
Product, Center per hr recovery cost per hr, $ 
eee Se 
1 300 90 150 
2 (1st pass) 450 95 200 
A 4 250 85 180 
2 (2nd pass) 400 80 220 
3 350 75 250 
1 500 90 300 
B 3 480 85 250 
4 400 80 240 
Maximum daily 
Sales sales, gal of 
Raw material _ price per finished 
Product cost per gal _ finished gal product 
A 5 20 1700 
B 6 18 1500 


Centers 1 and 4 run up to 16 hours a day; centers 2 and 3 run up to 12 hours 
a day. A final restriction is furnished by shipping facilities, which limit the 
daily output of A and B toa total of 2500 gallons. 
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CHAPTER 


NINE 


Sequencing 


In sequencing we are concerned with a situation where the effective- 
ness measure is a function of the order or sequence in which a series of 
tasks are performed. Problems fall into two broad groups. In the 
first group we have n tasks to perform, each of which requires processing 
on some or all of m different machines. The effectiveness for any given 
sequence of the tasks at each machine can be measured, and we wish to 
select from the (n!)” theoretically possible sequences: 


(a) those that are technologically feasible: i.e. those which satisfy the 
restrictions (if any) on the order in which each task must be processed 
through the m machines. 

(b) one (or several) of the technologically feasible sequences which 
optimizes the effectiveness measure. 


In theory, a solution by enumeration is always possible, but, in 
practice, the computation of effectiveness for a given sequence can be 
quite involved, and the number of cases for enumeration makes this 
approach prohibitive even for moderate values of m and n. 

[If m = 5 and n= 5, there are (5!)° = 25,000,000,000 cases to 
enumerate.] 

In the second type of problem, we have a job shop with a number of 
machines and a list of tasks to perform. Each time a machine com- 

250 
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pletes the task on which it is engaged we have to decide on the next task 
to be started. One of the characteristics of the situation is that the list 
of tasks will change as fresh orders are received. 

Unfortunately both types of problem seem to be intrinsically difficult, 
and at present solutions are known only for some special cases of the 
first type. At the time of writing, there appears to be no mathematical 
theory for the second type of problem, but some empirical rules have 
been tested with moderate success by simulation on computers. 


PROCESSING EACH OF n JOBS THROUGH 
m MACHINES 


There are n jobs (1, 2, -+-, ), each of which have to be processed one 
at a time at each of m machine centers (A, B, +++). The order of 
processing each job through the machine centers is given (for example, 
job 1 is processed in centers A, C, B in this order). We are also given 
the time that each job must spend at each machine center. The prob- 
lem is to find a sequence for processing the jobs so that the total elapsed 
time for all the jobs will be a minimum. 

Symbolically, let 


A; = time for job 7 on machine A 
B; = time for job 7 on machine B; etc. 
T = time from start of first job to completion of the last job 


We wish to determine for each machine a sequence, (#1, t2, °**, tn), 
where (i, t2, °+*, %n) 18 & permutation of the integers (1,2, ---,7), 
which will minimize T. 

Satisfactory solutions are available at present only for three special 


cases: 


(1) n jobs and two machines A and B; all jobs processed in the order 


AB. 

(2) n jobs and three machines A, B, and C; all jobs processed in the 
order ABC; other limitations as described below. 

(3) two jobs and m machines; each job to be processed through the 
machines in a prescribed order which is not necessarily the same for 


both jobs. 


These three cases are described in more detail in later subsections, 
which also give methods of solution and some examples. 
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Example 1 
Suppose that we have four jobs 1, 2, 3, 4 which must be processed 
through machines A, B, C, D in the following order. 


Job 1: ABCD 
Job 2: ACBD 
Job 3: BCDA 
Job 4: BCDA 


Each individual task (processing one job on one machine) takes one 
hour. 

(a) Show that the following job sequence, selected from among the 
(4!)* possible job sequences, is not feasible. 


Machine A: 1342 
Machine B: 3412 
Machine C: 1423 
Machine D: 4321 


(b) Find a feasible job sequence, and compute the corresponding 
elapsed time T' from the start of the first job started to the completion 
of the last job finished. 


SoLurion 

We first set down the two arrays, one representing the prescribed 
machine order for each job, and the other representing the suggested 
job order in each machine. 


An individual task is feasible at any particular instant provided the 
machine is idle, the job is idle, the machine is next for the job, and the 
job is next for the machine. Thus, at time zero, the tasks (A, 1) and 
(B, 3) are feasible; we indicate the completion of these two operations 
by circles about each machine and job, as below. 


Now (B, 4) is feasible; and we continue circling completed operations 
until we reach the following situation. 


At this point we can proceed no further, and conclude that the job se- 
quence under test is not feasible. 

We can modify the given sequence in such a way that it becomes 
feasible, by starting with the completed portion of the tasks: 


and then filling in the blank boxes so as to be compatible with the 
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prescribed machine order array. A resulting feasible sequence is 


Machine A: 1342 
Machine B: 3412 
Machine C: 1432 
Machine D: 4132 


To compute the elapsed time for this feasible job sequence, we recall 
that each individual task is assumed to require one hour, and proceed 
to construct a time-table. 


Tasks 
(A, 1) (B, 3) 
(B, 4) 
(B, 1) 
(C, 1) 
(C, 4) 
(D, 4) 
(C,3) (D, 1) 
(D, 3) 
(A, 3) 
(A, 4) 
(A, 2) 
(C, 2) 
(B, 2) 
©, 2) 


The elapsed time is 14 hours, This answer can also be obtained graphi- 
cally, by means of a Gantt chart of the sort illustrated on page 262. 

Note that we have not really solved the sequencing problem of this 
example; all we have done is to find a feasible sequence and compute 
the corresponding 7’. No mathematical method (besides enumeration) 
is available at present for finding the optimal sequence in problems of 
this dimensionality. 


ja} 
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Processing n Jobs through Two Machines 


This sequencing problem, for which a solution is available, is more 
completely described as follows: 


(a) Only two machines are involved, A and B, 

(b) Each job is processed in the order AB. 

(c) The exact or expected processing times Ay, Ag, +++, An, Bi, Ba; 
+++, By, are known. 
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The problem is to minimize T, the elapsed time from the start of the 
first job to the completion of the last job. 

It can be shown that the sequences that minimize T are the same 
for both machines. The method of computation, given here without 
proof, is due to 8. M. Johnson.* 


1. Select the smallest processing time occurring in the list Ay +--+ An, 
B, --- Bn. If thereisa tie, select either smallest processing time. 

2. If the minimum processing time is A,, do the rth job first. If it is 
B,, do the sth job last. Note that this decision will apply to both 
machines A and B. 

3. There are now n—1 jobs left to be ordered. Apply steps 1 and 2 
to the reduced set of processing times obtained by deleting the two 
machine processing times corresponding to the job already assigned. 

4, Continue in this manner until all jobs have been ordered. The 
resulting ordering will minimize T. 


Example 2 


We have five jobs, each of which must go through the two machines 
A and Bin the order AB. Processing times are given in the table below: 


ProcessinG TIME, HR 


Job Machine A Machine B 
1 5 2 
2 1 6 
3 9 7 
4 3 8 
5 10 4 


Determine a sequence for the five jobs that will minimize the elapsed 


time 7’. 


SoLuTION 
Applying the routine of the text, 
time is 1 hour for job 2 on machine 


PLE! 


i ion Schedules with 
*3. M. Johnson, “Optimal Two- and Three-Stage Production 
Setup Times Included,” Naval Research Logistics Quarterly, Vol. 1, 10, 1 (Mar. 1954). 


we see that the smallest processing 
‘A, Thus we schedule job 2 first: 
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The reduced set of processing times is 


Job A B 
1 5 2 
3 9 «Z 
4 3 8 
5 10,4 


The smallest processing time, 2, is B;. So we schedule job 1 last: 


ae DSS 


Continuing, we have 


Job A B 


: 3 Felton [2] 4] 1 
5 


_ 
i—) 
i 


a 
& 
> 
ab 


3 9 Paes 
so 4 ieking faa] fs] 
so that the optimal sequence is | 2] 4| 3], 5] 1 


We may calculate the elapsed time corresponding to the optimal or- 
dering, using the individual processing times given in the statement of 
the problem. The details are given in the following table. 


Machine A Machine B 
Job Timein Time out Time in Time out 
2 0 1 1 Us 
4 1 4 7 15 
3 4 13 15 22 
5 13 23 23 27 
1 23 28 28 30 


Thus the minimum elapsed time is 30 hours. Idle time is 3 hours for 
machine B, and 2 hours for machine A. 


Processing n Jobs through Three Machines 


No solution is available at present for the general sequencing problem 
of n jobs, three machines A, B, and C, prescribed order ABC for eacb 
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job, and no passing. However, the method of the previous subsection 
can be extended to cover the special cases where either or both of the 
following conditions hold: 

(a) The smallest processing time for machine A is at least as great 
as the largest processing time for machine B. 

(b) The smallest processing time for machine C is at least as great 
as the largest processing time for machine B. 

The method, stated here without proof, is to replace the problem 
with an equivalent problem involving n jobs and two machines. We 
denote the fictitious machines by Gand H, and define the corresponding 
processing times G; and H; by 


G; = Ait Bi 
HH, = B+ Cs 
We work the new problem, with prescribed ordering GH, by the method 


of the last subsection. The resulting optimal sequence will also be 
optimal for the original problem. 


Example 3 
We have five jobs, each of which must go through the machines A, 


B, and C in the order ABC. Processing times are: 


Job A B Cc 
1 4 5 8 
2 9 6 10 
3 8 2 6 
4 6 3 7 
5 5 4 ll 


Determine a sequence for the five jobs that will minimize the elapsed 


time T. 


SoLuTION 

Here min A; = 4, max B; 
we are justified in applying the method of 
lent problem becomes 


= 6, min C; = 6. Since max B; < min Ci, 
this subsection. The equiva- 


Processine TIMES 
Job G H 


arwnd 
= 
o 
eo 
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Because of ties, there are several optimal orderings. They are 


Any of these orderings may be used to sequence the jobs through 
machines A, B, and C; and they will all yield a minimum elapsed time 
of 51 hours. 


Processing Two Jobs through m Machines 


In this subsection, we consider the following situation: 

(a) There are m machines, denoted by A, B, C, ---, K. 

(b) Only two jobs are to be performed: job 1 and job 2. 

(c) The technological ordering of each of the two jobs through the 
m machines is prescribed in advance. This technological ordering need 
not be the same for both jobs. 

(d The exact or expected processing times A;, By, «++, Ki, As, Bz, 
+++, Ko are known. 


The problem again is to minimize T,, the elapsed time from the start of 
the first job to the completion of the last job. 

The method will be presented by means of an example. Let us 
suppose that we have two jobs which are to be processed through four 
machines, A, B, C, and D. The prescribed technological orderings are: 


Job 1: ABCD 
Job 2: DBAC 


Processing times are given in the following table. 
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Machine Machine Machine Machine 
A B Cc D 


It is obvious that at time zero we should put either job 1 on machine 
A or job 2 on machine D. Tt is not so obvious, however, that we should 
start both jobs together. Something might be gained by starting one 
job first and keeping the other machine idle for a time. The same sort 
of problem will recur at later stages of the schedule: should we start a 
job through its next process as soon as the machine becomes available, 
or should we wait? With a little thought the reader can see that the 
sequencing problem is to determine, for each machine, whether the jobs 
shall be processed in the order 1, 2, or 2,1. Such a set of m decisions 
will be called a program. The first step in the solution is to prepare a 
table showing all possible programs. [There are 2” possible programs; 
here, with m = 4, there are 16 programs.] 

To abbreviate the notation for the programs, we resort to a symbolic 
device. Let 


a = the decision {job 1 before job 2 on machine A} 
a = the decision {job 2 before job 1 on machine A} 


and similarly for the other machines. With this notation, we obtain the 
following table displaying the 16 possible programs for the case of two 
jobs and four machines. 


Program No. 
6 7 & 9 (10, 1 12 18 14 15 16 


! k- 
[Such a table can be generated by filling the first 2» spaces on the 
kth row with barred letters, and then completing the row with alternate 
blocks of 24» letters without pars and with bars. For example, row 
3 starts with 2°) = 4¢’s; then 4c’s; ete.] Note that the programs 
give the ordering of the two jobs on a given machine, but say nothing 
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about the (technological) ordering of the machines for a given job, 
Thus abcd (program 1) merely says “do job 2 on each machine before 
job 1,” and could just as well be written as baed or bcda, etc. 

The next step is to delete all programs that are not technologically 
feasible, in that they are not consistent with the two prescribed tech- 
nological orderings. 


Rule 

Suppose that machine X must precede machine Y for job 1, and 
machine Y must precede machine X for job 2. Then no program that 
contains both the decision @ and the decision y is technologically fea- 
sible. 


The rule is easily proved. Technologically, we must have the following 
time relationships between events. 


{1 on X] precedes [1 on ¥] 
[2on Y] precedes [2 on X] 
A program containing the decisions ¢ and y says, in part: 
(2 on X] precedes [1 on X] 
[Lon Y] precedes [20n Y] 
The first, second, and fourth statements imply 
[1 on X] precedes [2 on X] 


which violates the third statement. 
In applying this rule, we first identify all machine pairs that appear 
in reverse order for the two jobs. In the example, these pairs are 


AB AD BD CD 


where the given order of a pair is asfor job 1. By the rule, all programs 
containing ab, ad, bd, or éd are not technologically feasible. Thus we 
eliminate programs 3, 7, 9, 10, 11, 12, 13, 14, 15 and are left with 
Program No. 
1: 2 4 bi 6 816 


ae oe 
aa oe 
aaoe 
us oe 
ao oR 
aoc, 
aowea 
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Next, we operate with the reduced set of programs and delete those 
that cannot be optimal. The rules for performing this operation are 
stated here without proof (Table 1); the proofs may be found in the 
original reference by Akers and Friedman.* (Dots indicate the pos- 


TABLE 1 
Delete 
Technological orderings programs 
Rule Job 1 Job 2 containing 
ae re a a Te 
I Xe: ¥ aces zy 
Il Xi seoeVaes Soup. @ anu ay 
Tl oe Xe ¥ EG. @ 4i2r ay 
TVin, sis Roane Oe ety? Cae 2] 
Vv XY Zee wee Kee YOr woe 
VI Xoowt YZeres eee XY ve Zove yz 


sibility of other machines occupying the corresponding position in the 


technological ordering.) 

In our example, where the technological orderings are ABCD and 
DBAC, we see that rule I (A+++ D vs. D = -) eliminates programs 
containing ad; thus program 16 is eliminated from the reduced set. 
Using rule II (A +> Cowes vs. 0 AC ---), we eliminate program 5 
which contains de. Rules Ill, IV, V, VI do not apply to the given 


technological orderings. We are left with 


Weare thus left with five programs which are technologically feasible 
and which do not violate the rules for optimality. The last step is tc 


éalculate the elapsed times for each of the remaining programs, and to 
choose that program for which the elapsed time 18 the smallest. The 
simplest method is probably the Gantt chart, as shown in Figure 9.1. 
*§, B, Akers, Jr., and J. Friedman, ««4Non-Numerical Approach to Production 
Scheduling Problems,” Operations Research, Vol. 3, no. 4 (Nov. 1955). 


Job 1-443 ——— 2, 
T=22 
Job Zepeten SBS eA ic” 


Figure 9.1. Gantt chart for program 4 (abtd). Technological Orderings:; ABCD 
for job 1, DBAC for job 2. 


The reader should verify that program 8, requiring 16 hours, is the 
optimal program. 

There is also a graphical approach to the two-job m-machine problem, 
which is rather simple to apply and usually leads to good (though not 
necessarily optimal) results. The first step in the procedure is to draw 
a set of axes, where the horizontal axis represents processing time on 
job 1, and the vertical axis processing time on job 2. (See Figure 9.2.) 
The machine times for the two jobs are laid out on the corresponding 
axes, in the given technological order. The coordinates of any point 
on the grid represent a possible state of completion of the two jobs; 
thus point Q in Figure 9.2 denotes an instant when job I is 1/3 com- 
pleted and job II is 1/4 completed. A program involves starting with 
point Q at the origin and moving Q through various states of completion 
(i.e. points) until Q reaches the point marked “Finish.” Physically, 
the path taken by Q is limited to a series of segments which are hori- 
zontal, or vertical, or diagonal with slope 1. Movement to the right 
means that job I is proceeding while job II is idle, movement upward 
means that job II is proceeding while job I is idle, and movement 
diagonally means that both jobs are proceeding simultaneously. 

A further proscription on the movement of Q is that both jobs cannot 
be processed simultaneously on the same machine. This prohibition is 
translated to graphical form by taking the “Cartesian product”’ of 
A-horizontal and A-vertical, and agreeing that diagonal movement 
through this blocked-out area is forbidden; and similarly for the other 
machines. In our problem, the result is that diagonal movement 
through the shaded areas of the figure is prohibited, 

An optimal path (program) is one that minimizes idle time for job I 
(horizontal movement). Equivalently, an optimal path is one that 
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minimizes idle time for job II (vertical movement). Evidently we want 
to choose a path with as much travel on the diagonal as possible. 

A good path, according to this criterion, is chosen by eye, and drawn 
on the chart (see line with arrows). The elapsed time is obtained by 
adding the idle time for either job to the processing time for that job. 


20 


fe ye bp 


Start 


ja} —B >} +} 


Figure 9.2. Graphical solution for the two-job m-machine sequencing problem. 


Idle time for the chosen path is seen to be 4 for job 1, and zero for job 2. 
Elapsed time = 12 + 4 = 16 = 16 +0. Note that the graphical ap- 
proach happens to yield the same answer as the rigorous reduced- 
enumerative approach used earlier, 
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So far we have confined our discussions to situations where a given 
set of jobs had to be performed and we wished to determine a sequence 
that would best meet certain criteria. There is another type of situation 
which occurs frequently in practice, and about which even less is known. 

In a job shop, orders arrive daily, each of which requires given 
amounts of time on various machines, and each of which may have a 
promised completion date. The shop foreman has to decide, each time 
he has an idle machine, which job shall be put on the machine next. He 
has a number of possible objectives to meet in making his decision: 


(a) Minimize the average amount of time by which promised dates 
are missed. 

(b) Minimize the maximum amount by which promised dates are 
missed. 

(c) Minimize setup costs. 

(d) Maximize production. 

(e) Meet certain priorities assigned by management. 

(f) Minimize production costs. 

(g) Spread the work load uniformly over the available machines. 


In addition, there are all the problems associated with inventory 
policies. 

So far the only approach to problems involving criteria of this sort 
seems to be to simulate various possible decision rules on a computer 
and to measure the extent to which each rule meets the criteria. It is 
hardly possible to give a discussion of such simulation in a book of this 
type. Examples of rules whose efficacy can be tested by simulation 
are: Start production of the job that has the earliest promise date. Or 
start production of that job which, if started now, would be finished 
latest before its promise date. 


THE TRAVELING SALESMAN PROBLEM 


The problem faced by the traveling salesman is to select a route that 
will minimize the total distance traveled in visiting n cities and return- 
ing to his starting point. Here we prefer to state this problem in an 
equivalent form. Suppose that n products are to be made in some order 
on a continuing basis, and the setup cost for each depends on the pre- 
ceding product made. Given the setup cost when product A; is followed 
by Aj, denoted by c;;, we wish to determine the sequence of products 
that will minimize the total setup cost. We arrange the individual 
setup costs in a square matrix, with the leading diagonal blank. 
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Each non-diagonal element shows the cost of changing from the product 
listed to the left of its row to the product listed at the top of its column. 
Since changing from a product to itself involves no setup costs, the 
leading diagonal is blank. 

Our problem is to choose a set of n elements, one in each row and 
one in each column, so as to minimize the sum of the elements chosen. 
Thus far the problem is identical with the assignment problem of 
Chapter Eight; however there are two further restrictions on our 
choice of elements, one of which makes the problem considerably more 
difficult. Clearly we cannot choose an element along the diagonal, and 
this can be avoided by agreeing to fill the leading diagonal with infi- 
nitely large elements. The second restriction is that, having produced 
product A; say, we do not wish to produce it again until we have made 
all the other products. For example with five products Aj, «+>, As, @ 
solution comprising the elements ¢i2, C21, C34, C45, 53 would not be 
acceptable, because it corresponds to a situation where we start with 
A, make Ag, and then make A; again, before making Ag, Ag, As. All 
known methods for obtaining a solution to the traveling salesman 
problem involve resorting to enumeration at some stage. At the 
present time the best available procedure is to solve the problem as if it 


were an assignment problem, and, if the solution obtained does not 


satisfy all the requirements, to modify it 


obtained that does. : 
Before illustrating the problem with an example, we give a formal 


statement of the problem. 

Given a square matrix [cij] of size n and with cj = © for each i, 
find a subset of n elements Cpq, Car) C-sy ***s Cur Copy where 7, 9, «*", 
u, v are some permutation of the integers, 1, 2, «++, %, such that the 
sum of the elements of the subset shall be a minimum. 

It will be realized that there are n! permutations of the numbers 


1, «++, n; but, from any given subset we could form 7 equivalent sub- 


sets, merely by starting partway through the given subset and listing 


“by hand” until a solution is 
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the elements omitted in order at the tail end. There are thus (n—1)! 
subsets among which we must search for the solution. 


Example 4 
Given the matrix of setup costs below, show how to sequence produc- 
tion so as to minimize setup cost per cycle. 


To 
A; Ay As Ay Ag 


SoLuTIon 
We commence by treating the problem as an assignment problem: 
Subtract minimum element from each row and column. 


To 
A; Ag As Ay As 


The zeros of this particular matrix provide a solution to the assignment 
problem, shown by distinguished elements. However, this is not a 
solution of the traveling salesman problem, because it tells us to go 
from A, to As and then back to A. 

We examine the matrix for some of the “next best” solutions to the 
assignment problem, and try to find one that satisfies the additional 
restrictions. The smallest non-zero element is 1; so we try the effect 
of putting such an element in the solution, 
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We start with the element (1, 2) and delete row 1 and column 2. In the 
remaining 4-by-4 matrix, there is no solution to the assignment problem 
among the zeros, but it is easy to see the best solution lies in the distin- 
guished elements. Moreover, this is also a feasible solution of the 
traveling salesman problem; the “cost” in the reduced matrix is 2. We 
now know that any solution where the “cost” exceeds 2 is not opti- 
mum; we need only examine solutions containing the element 1 (row 4, 
column 8) not so far used to see if a better solution exists. On deleting 
row 4 and column 3 we see that the remaining 4-by-4 matrix does not 
have a solution among the zeros, and we conclude that the minimum 
“cost” for the reduced matrix is 2. 
Thus the best cycle is 


As > A, ~ Ap > As > As > As 
and the setup costs will total 1+ 2+3+4+5 = 15. 


PROBLEMS FOR SOLUTION 


Set XII 


1. A bookbinder has one printing press, one binding machine, and the 
manuscripts of a number of different books. The times required to perform 
the printing and binding operations for each book are known. We wish to 
determine the order in which books should be processed, in order to minimize 
the total time required to turn out all the books. 


Book time time 
A ————————— 
30 80 


QOakwnre 
8 
s 
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2. An additional operation is added to the process: finishing. What should 
the order of the books be? 


Pruting _ Binal Finishing 


Book time time time 
1 30 80 20 
2 120 100 40 
3 50 90 60 
4 20 60 120 
5 90 30 70 
6 110 10 30 


[Hint: Do not seek a rigorous optimum, as the methods of the text are not 
applicable to this problem. Do try to find an economical solution.] 


3. A machine shop has six machines: A, B, C, D, HZ, and F, Two jobs must 
be processed through each of these machines. The times spent on each machine 
and the necessary sequence of the jobs through the shop are given below. 


Order 1 2 3 4 5 6 


JobI A—20 C—10 D—10 B-30 E—2 F—15 
JobIL A—10 C—30 B—15 D—10 F—l15 E—20 


In what order should the jobs be done on each of the machines, in order to 
minimize the total time necessary to finish the jobs? 
(Do in two ways.) 


4. A machine operator processes five types of items on his machine each 
week, and must choose a sequence for them. The main costs involved are setup 
costs and priority. The setup cost per change depends on the item presently 
on the machine and the setup to be made, according to the following table: 


To item 
ABCDE 


From item C 
D 
E 


In addition, there is a priority rating among the items, A and B having a higher 
priority than C, D, Z. This may be interpreted as adding an additional cost 
of 5 when any high-priority item immediately follows a low-priority item. 
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If he processes each type of item once and only once each week, how should he 
sequence the items on his machine? 
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Dynamic 
programming 


Dynamic programming is a newly developed mathematical technique 
which is useful in many types of decision problems. In most of the 
situations we have discussed in this book, where a series of consecutive 
decisions have to be made, the optimal over-all policy could be arrived 
at by considering the effects of each decision separately. In the inven- 
tory area, for example, there are some situations where a policy of 
producing each month so as to minimize inventory cost for the month 
immediately affected will minimize the inventory cost for the whole 
year. In many other problems, however, it is by no means clear that 
the over-all return from a procedure of optimizing over each individual 
period is the best that can be attained. It might be, for example, that 
a bit of a sacrifice in January’s return might place us in a much stronger 
position in relation to February, and soon. Dynamic programming is 
a method for investigating such possibilities. 

The subject owes much of its development to Richard Bellman and 
his associates at the Rand Corporation; the reader interested in a more 
complete treatment is referred to the book by Bellman cited in the 
references at the end of this chapter. 
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PROBLEMS WITH A FINITE NUMBER 
OF CONSECUTIVE DECISIONS 


The method of dynamic programming is based on the mathematical 
notion of recursion. We will start by discussing a purely mathematical 
example to illustrate the method, and then give a more complicated 
decision-type problem, 

Example 1 

A positive quantity c is to be divided into n parts in such a way that 
the product of the n parts is to bea maximum. Use recursion to obtain 
the optimal subdivision. 

SoLuTion 

Let fa(c) denote the maximum attainable product regarded as a 
function of ¢ (the given positive quantity) and n (the number of parts 
into which the quantity is to be divided). 

If we regard c as fixed, and let n vary over the positive integers, fn(c) 
becomes a function of the diserete variable n. We may define f,(c) = c, 
and we then have, for n = 2, 


fo(c) = the maximum value of y fi(e — y) 
or fo(c) = the maximum value of y(c — y) 
subject to the restriction 
O<ysSe 
We may abbreviate the notation by writing this as 
(10.1) fo(c) = Max {y(c — y)} 
O<uSe 


By simple calculus, we find that the function y(c—y) attains its maxi- 
mum for y = ¢/2, and this value of y certainly satisfies the restriction. 


Thus, for n = 2 we have 
Optimal policy (c/2, c/2) 
fale) = 0°/4 


Now suppose that we have solved the problem [that is, we know 
fale)] for n = 2, 3, «++, m, and inquire into the solution for n = m+1. 
If we take the first of the m+1 parts as y, we have an amount c—y to 
be divided into m further parts. To proceed further, we make use of 


(10.2) 
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Bellman’s principle of optimality, which states: 


An optimal policy has the property that, whatever the initial state and 
initial decision are, the remaining decisions must constitute an optimal 
policy with respect to the state resulting from the first decision. 


If the initial state is c, and the first decision is y, then the state just 
preceding the second decision isc—y. By definition of f,,, the maximum 
achievable product for c—y divided into m parts is f»(c—y). Then the 
conditional maximum product for c divided into m+1 parts, given the 
initial choice of y, is 


YSm(c — y) 
By the principle of optimality, we therefore have 
(10.3) Jm4i(c) = Max {y fn(e — y)} 
O<vSe 


Since the function f,,(x) is known by our assumption, the quantity in 
braces in (10.3) is a known function of the single variable y. The 
maximizing g can be obtained, and we will have 


Fm4i(c) = 7 fm(c — 9) 
Optimal policy: (g + m-part policy for c—#) 


The explicit solution for n = 2 has already been given in (10.2). 
For n = 3, (10.3) specializes to 


(10.4) 


fa(c) = Max {y = 9 
yielding ag of c/3. Since fo(2c/3) = 4c?/36 = c?/9, we have, forn = 3: 
Optimal policy (¢/3, ¢/3, ¢/3) 
Jfa(c) = 8/27 


With (10.2) and (10.5) as a guide, we may conjecture that the solution 
for general n is: 


(10.5) 


Optimal policy (c/n, c/n, ---, c/n) 
Sule) = (c/n)” 


This resuit 1s readily established by induction. Assuming that (10.6) 
holds for n, we have, from (10.3), 


(10.6) 


(c— oy 
ati(c) = M 
fn4i(c) Max {uy 


n JS. 
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Repertie FSh- etinae 
[c/(n + 1), ¢/(n + 1), +++, ¢/(n + I] 
fai@ = [e/a + DF 


The problem has been solved in this manner in order to introduce 
the notion of recursion. A more direct approach would be to use the 
method of Lagrangian multipliers, as follows. Let C be divided into-n 
parts ty, 22, +++, Zn, 80 that we have to maximize the function 


Thus 


y = U%_°* In 


subject to the constraints. 


n 
Su =C; 120; 220; +5 m20 


tol 


Let 
Y = ay%q-++ In + MC — Lal 


The maximum value of y, for admissible values of x;, either occurs at the 
subset (2, «++, @n) of the solution (a, +++, %n,) of the system 


Malas i= 1,2,-:-,7 
Ox; 

oY 

hE) 

On 


ur on one of the bounding planes x; = 0. 


ivably it could occ 
or conceivably it co’ Prasetiy Pound tobe 


The solution of the system of equations 


c 4 
m=- foreach? 
n 


asic 


constraints and yield a value for y 


yielding 


These values for 2; satisfy all the o 
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larger than the value when any 2; is zero; so together they constitute 
the optimal subdivision of ec. 


An essential difference between the two methods is that the recursive 
(dynamic programming) approach changes one problem in n variables 
into n problems, each in one variable. In more complicated examples, 
the simultaneous equations resulting from the classical calculus ap- 
proach may be extremely difficult to solve; and, if more than one 
solution exists, we must ascertain which solution yields the absolute 
maximum. Worse still, the calculus approach will not necessarily 
reveal the maximum, subject to constraints, if it lies on the boundary 
of the admissible region or if we are dealing with non-differentiable 
functions. (In the previous example, calculus will not reveal a minimiz- 
ing subdivision of c, which is any subdivision for which at least one of 
the x; is taken as zero.) We thus would have to examine the values of 
the function on the boundaries in order to be sure that the “maximum” 
(if any) revealed by calculus is not exceeded on them. We may even 
have to use a “search technique” to examine the function numerically 
over the entire admissible region. With a large number of variables 
and complex boundaries, this can become a far more difficult task than 
n iterations of a one-variable problem with comparatively simple 
boundaries. 

The advantage of the dynamic programming approach to such 
problems is heightened when the function we are attempting to maxi- 
mize does not possess partial derivatives at all points in the region of 
interest. The ordinary approach is to compute functional values at 
sequentially chosen points in n-space, eventually locating the maximum 
either exactly or, more likely, within some small interval. Roughly 
speaking, the amount of computation increases exponentially with the 
number of variables. When it is possible to reformulate the n-variable 
problem as a series of n problems each in one variable, the computa- 
tional labor is much reduced. 


Example 2 

A man is engaged in buying and selling identical items, each of 
which required considerable storage space. He operates from a ware- 
house which has a capacity of 500 items. He can order on the 15th 
of each month, at the’prices shown below, for delivery on the first of 
the following month. During a month he can sell any amount up to 
his total stock on hand, at the market prices given below. If he starts 
the year with 200 items in stock, how much should he plan to pur- 
chase and sell each month in order to maximize his profits (cash re- 
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ceipts minus cash expenditures) for the year? (Decision problems of 
this type are often referred to as “‘warehouse problems.”) 


Cost prices [ei] Sales prices [pi] 
January 15 150 January 165 
February 15 155 February 165 
March 15 165 March 185 
April 15 160 April 175 
May 15 160 May 170 
June 15 160 June 155 
July 15 155 July 155 
August 15 150 August 155 
September 15 155 September 160 
October 15 155 October 170 


November 15 150 November 175 
December 15 150 December 170 


SoLuTion 

Let us call January month 1, February month 2, ete., and introduce 
the following notation: 

2; = amount to be sold during month ¢ 

yi = amount to be ordered on the 15th of month 7 

1p; = sales price during month 7 

c; = purchase price on the 15th of month ¢ 

3; = stock level on the Ist of month 7 

H = warehouse capacity 
Each month we must decide on the values of z; and y; Asa basis for 
these two decisions, we have future cost prices and sales prices, the 
existing inventory level sj, and the restriction that no future s; can 
exceed the warehouse capacity H. (An additional restriction is, of 
course, that no 2j, yi, OF $; can be negative.) 

Following the notation of the previous problem, we define 

fn(8) = maximum achievable return during the remaining n months, 

if the existing stock level is 8 


Thus, fi2 refers to January, fir to February, ete. 
On the first day of December, the final month, we have 


Si(si2) nu Mee {pistis — Catia) 


io tl 


where x12 and yz2 are subject to the restrictions 
0 < 212 S 812 
0 <4 <A — [12 — *12] 
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The solution for December is clearly 


(10.7) ue aly 
Policy: Take 212 = 812} yig = 0 


On November 1, there are two months left and the existing stock 
level is 5;;. Thus 


(10.8) fo(si1) = Max {put — cruyas + fils + yur — 211} 


Fy | 


subject to 

(10.9) Oa < sy, O<yn SH — (81 — 41] 

We know from solution (10.7) that f;(s1: + y11 — 211) has the value 
Pra(8i1 + 911 — 211) 

Thus we have explicitly 

(10.10) fo(si1) = Max {[pi1 — proleas + [pia — crilys1 + Pi2si1} 


batinaty 


subject to the same restrictions on a; and y;). 
Since the expression in brackets in (10.10) is linear in 21, and y1, 
and the restrictions on 21; and y;; are also linear, we know from the 


(s, H) 
y 
(0,H -s) 
0,0) (8,0) x 


Figure 10.1. Restrictions on sales and purchases during a month starting with 
stock level s. 


simplex discussion in Chapter Eight that the solution will be at one of 
the corners of the area satisfying all of the restrictions (10.9). ‘The re- 
strictions are: 211 > 0, yur > 0, @11 <8, — 24. + yy < H — su. 
Figure 10.1 shows the area including all feasible decisions, The four 
corners representing possible optima are, for the situation in (10.10), 
(0, 0), (0, H—s11), (811, H), (83,0). The actual optimum is obtained 
by enumeration, yielding a solution for x,, and Yi in terms of 811. 
Further, since s12 = 811 + 11 — 21, the policy (10.7) for December 
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can also be stated in terms of 8,;, Another iteration would give the 
policies for October, November, and December in terms of 819, and so 
on, until finally we would have the policies for all the months stated in 
terms of the known initial stock level s;. 

A complete algebraic solution for general values of ¢;, p;, H, 8; is ex- 
tremely cumbersome, and will not be given here. Instead, we work out 
the numerical solution to the problem as stated. 


The general relationship is 
Sn4i(s) = Fasseos {Piz-nt — Crony + fas + y — 2)} 
Sze 
O<ysH—e+= 
Notice that, as f,(s) is linear in s, each successive f,(s) is also linear in s, 
Hence the expression to be minimized is linear in z and y. 
Repeated application of this equation yields the following: 
December 1: Maximize 170212 — 150y12 
Policy: 212 = S12, yi2 = 0 
Si(si2) = 170812 
November 1: Maximize 5241 + 20y11 + 170811 
Policy: 211 = 81, Yiu = 
Jo(sir) = 175811 + 20H 
October 1: Maximize —5210 + 20y10 + 175810 + 20H 
Policy: 210 = 810, Yio = 
fa(si0) = 170819 + 40H 
September 1: Maximize —10z9 + 15yo + 1708 + 40H 
Policy: 29 = 8, Yo = H 
fa(8o) = 16089 + 55H 
August 1: Maximize —5zy + 10ys + 160s + SH 
Policy: 23 = 88, ¥s = 
Js(83) = 15533 + 65H 
July 1: Maximize 15587 + 65H 
Policy: Any feasible (7, ¥7) 
fe(s7) = 15587 + 65H 
June 1; Maximize —5ys + 1558 + 65H 
Policy: ye = 0, any feasible % 
fr(0) = 15589 + 65H 
May 1: Maximize 1525 — 5ys + 1558 + 65H 
Policy: 25 = 85, ¥5 = 0 
fa(ss) = 17085 + 65H 
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April 1: Maximize 5x4 + 10y4 + 170s, + 65H 
Policy: 24 = 84, yg = H 
fo(ss) = 17584 + 75H 

March 1: Maximize 1023 + 10y3 + 17583 + 75H 
Policy: 23 = 83, ¥3 = H 
Sio(s3) = 18583 + 85H 

February 1: Maximize —20z2 + 30y2 + 185s + 85H 
Policy: 22 = 89, yo = H 
fii (82) = 16582 + 115H 

January 1: Maximize — 10x; + 25y; + 165s; + 115H 
Policy: 2 = %, y, = H 
fi2(s1) = 165s; + 130H 


Here s; = 200, H = 500. Thus the maximum achievable return 
Fi2(200) is (165)(200) + (130)(500) = 98,000. This is achieved by 
means of the optimal policy summarized in Table 1. 


TABLE 1 
Month Tnitial stock Sell Buy 
(s) (x) (y) 
January 200 200 500 
February 500 500 500 
March 500 500 500 
April 500 500 500 
May 500 500 0 
June 0 0 0 
July 0 0 y(S 500) . 
August y y 500 | 
September 500 500 500 . 
October 500 500 500 
November 500 500 500 
December 500 500 0 


Before leaving this problem we mention that Stuart Dreyfus * has 
discussed it in greater detail than we have here. He has been able to 
show that: 


1, The optimal n-period return is a linear function of the initial 
stock with coefficients dependent on the cost and selling prices. 


*See “An Analytic Solution of the Warehouse Problem,” by Stuart E, Dreyfus, 
Management Science, Vol. 4, no, 1 (Oct. 1957). 
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2. The optimal policy at any stage is independent of the initial stock 


at that stage. 
3. There is an optimal policy with the following structure: Do noth- 


ing for k periods (k may be zero), and then oscillate between a full and 


an empty warehouse. 
4, The policy and return can be calculated from recurrence relation- 


ships between the coefficients of the linear return function. 


PROBLEMS FOR SOLUTION 
Set XIV 


1. Asector of a circle PoOP,, of unit radius subtends an angle e at the center. 
A set of n—1 points Py, Ps, *++, Pn—1 is chosen along the arc. Show that the 


length of the broken segment PoPiP2 --: Pais 


where 0; is the angle between OP;—1 and OP;. 
If f,(a) is the maximum length of PoP:P2--* Pn show that 


a) 
fala) = Max. {2 sin 5 + Jua(e — 0} 


Hence, show inductively that 
a 
Snr(e) = 2nsin on 


sides with the largest perimeter, which can 


d deduce that the polygon of n a 
Pee eee igh e equiangular polygon. (See Figure 10.2.) 


be inscribed in a given circle, is th 


Figure 10.2, Diagram for Problem 1. 
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2. The manufacturing process for a perishable commodity is such that the 
cost of changing the level of production from one month to the next is twice 
the square of the difference in production levels. Any production not sold by 
the end of the month is wasted at a cost of $20 per unit. Given the sales fore- 
east below, which must be met, determine a production schedule to minimize 
costs. Assume that December production was 200 units. 


Month January February March April 
Sales forecast 210 220 195 180 


(Hint: Let fn(p) be the minimum achievable cost when last month’s production 
was p and there are n months to go. Show that 


Su(p) = Min {2(x — p)? + 20(@ — 8) + fna(z)} 


where s = sales forecast for the current month 
2 = production for the current month 


Use this formula for successive months starting with f,(p) for April and 
working backward to f,(p) for January.] 

[Ans: January February March April 

210 220 210 205] 


8. (A replacement problem.) A machine which produces light bulbs has 
one part that is liable to failure, in a manner that is a function of the number 
of bulbs made since the part was replaced. If the part fails in service, no 
bulb is produced and there is a cost cy associated with wasted time, material and 
replacement. If the part is replaced before failure, there is a cost c,(< cy). 
The probability of failure on the next bulb, when m bulbs have been made, 
is Pm. 

After each bulb is produced, we have the choice of replacing the part or 
trying to make the next bulb with the existing part. The decision to replace 
or not will depend on the relative costs. Let fn(m) be the least expected cost 
of Psi n more bulbs when the part has already made m bulbs. 

ow that 


Fo(m) = min {fa(0) + er (1 — pr) foal + 1) + Paley + f4(0)]} 
and that 
$n0) = faa) + PO cy 


Po 
0) = 
fh) == me 
Given 
po= 0, pr=O.1, p2=0.2, ps=04, py = 0.6, ps = 1.0, 
co =3, c-=1 


compute successively fi(m), fa(m), «--, fo(m) for m = 0, 1, 2,3, 4,5. What do 
your results suggest about a replacement policy when the number of bulbs 
to be produced is increased indefinitely? 

[Ans. For large n, fa(m) is minimized by replacing whenever m > 3.] 
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4. You are playing a game in which you are allowed to make as many as 
10 successive draws from a normal distribution with mean zero and standard 
deviation unity. You are allowed to stop the game at the end of any draw, 
and are paid an amount equal to the result of that draw. At each step, the 
decision of whether or not to continue clearly depends on (1) the value just 
drawn, and (2) the stage of the game. Determine a strategy that will maxi- 
mize your expected gain. 


PROBLEMS WITH AN INDEFINITE NUMBER 
OF CONSECUTIVE DECISIONS 


So far we have assumed that the number of consecutive decisions to 
be made is a known integer: e.g.,n = 12in the warehouse problem. In 
many situations, however, the number of decision periods over which we 
should attempt to optimize is not obvious. Whether the decision maker 
should consider only the following year, or the following five years, or 
the following fifty years is a fundamental problem in value theory; and 
it is not our purpose to attempt such an analysis. We can observe, 
however, that business operations are seldom designed to terminate at 
a specific point in time; usually it is an implicit intention of manage- 
ment to continue in business indefinitely. Thus management may wish 
to minimize total expected costs over the indefinite future. 

Unfortunately, so long as some substantial non-zero costs are to 
be expected for each decision period, all policies will result in an un- 
limited cost if we consider a long enough time in the future. It is mean- 
ingless to talk of choosing a policy that will minimize a sum, divergent 
for all policies. Nevertheless, management regularly chooses policies 
whose results are to be measured over the indefinite future, and often 
operations research workers must assist in such decisions. 

There are two ways out of the difficulty. The first is to discount costs 
to be incurred in the future. If decisions are to be made (and costs 
incurred) infrequently, say less often than once a year, this makes sense 
both intuitively and economically. Intuitively, one feels that costs to 
be incurred this year should have more influence on our decisions than 
those to be incurred next year, and so on. Economically, the postpone- 
ment of expenditure is to be desired as the savings can presumably be 


invested elsewhere. f 
If the discount factor is less than one, and if costs each year are the 


same order of magnitude, the sum of discounted costs will converge, 
and it is meaningful to seek a policy that minimizes the sum of all 
future discounted costs. 
There is one disadvantage to 
are to be made frequently, say once a wee! 


the use of a discount factor. If decisions 
x, the idea of discounting over 
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a week (even with a discount factor very nearly equal to unity) is 
difficult to reconcile with our intuitive notions of business and eco- 
nomics. This dilemma is avoided by the second of the two expedients 
referred to above. We have already seen (Example 3 of Chapter Five) 
that, for any series of future payments, there is an equivalent series of 
equal payments with the same present worth as the original series, and 
that, as the discount factor tends to one, the equal payments approach 
the average of the original series. Thus, if the decisions are to be made 
so frequently that the discount factor is effectively one, we can endeavor 
to choose a policy that minimizes the average cost per decision as the 
number of decisions tends to infinity. 

In most practical problems, conditions are such that the total of the 
discounted costs over the indefinite future is finite, and the average cost 
per decision converges as the number of decisions approaches infinity 
Moreover, the policy that minimizes long-term average cost will be the 
limiting case, as the discount factor tends to unity, of the policy that 
minimizes the total of the discounted costs. Proofs of convergence 
are not given here; but we note that proof of convergence is usually 
casier for the total of discounted costs, and that, in general, when total 
discounted costs converge, so does the average cost. 

It sometimes happens that a policy that minimizes long-term average 
costs leads to the same individual decisions as a policy that aims only 
at minimizing cost during the period immediately affected by the 
decision. The following example illustrates a situation where this 
happens, and also gives the procedure for finding the long-term average 
cost associated with an optimal policy. 


Example 3 

A dealer places an order with his wholesaler on the first of each 
month, and obtains delivery one month later. The cost of holding 
inventory is C; per unit per month, and the cost of shortage is (2 per 
unit per month—shortages being carried over from one month to the 
next. If the monthly demand z isa random variable with density func- 
tion p(z), find the policy that minimizes the long-term average costs 
per month, and show that it is the same as the policy that minimizes 
the expected cost during the month following delivery. 


SoLuTIon 

The situation here is very similar to that of Example 7 in Chapter 
Four, which dealt with the recruitment of airline hostesses. The main 
difference is that we are now working with continuous quantities 
instead of discrete. 
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Let s = stock on hand (including the delivery just received) on the 


first day of the month in question 
z = size of order to be placed 
S=s+z 


Since s is known at the time the decision is made, it is immaterial 
whether we regard z or S as the decision variable. For convenience in 
presentation, we shall use S. A policy is some function S of s, and an 
optimal policy is a function S(s) which minimizes the long-term average 
monthly cost. It will be noted that, if there is no way of reducing 
stocks, apart from sales, we are restricted to policies for which S(s) > s. 


Now let « = demand in current month 
v = discount factor for one month (v < 1) 
I(s) = expected cost for current month [note that J(s) is inde- 
pendent of our policy—it is defined for both positive 
and negative s] 
Jn(s) = total discounted cost (expected) over n months, when 
we use an optimal policy S(s) 


The following month will start with a stock level s -z +2=S—2, 
and we will have, by the principle of optimality, 


(10.11) f(s) = Min {10) +0} fr—~a(S — 2) v(2) dr} 
S>s 0 


If we assume (as can be proved) that the sequence of functions fn(s) 
tends to a limiting function f(s) as n tends to infinity, then f(s) must 


satisfy the functional equation: 
(10.12) f(s) = iin {1 +0 f f(S — 2) n(2)az| 
>e 


The first term in braces, J(s), is clearly the same for all S. The sec- 
ond term is a function of S, which we may call g(S), and may be 
interpreted as the present value of the return from an optimal policy 
starting a month hence (in a situation conditioned by the fact that S-s 
is the order placed now). It is reasonable to suppose that this term 
graphed as a function of S should have roughly the form shown in 

"The ee So which minimizes the plotted function is independent 
of the present stock level s. If So exceeds s, then Sp is the minimizing 
8 in (10.12), and our decision today should be to order So—s. If So is 
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less than s, then the best we can do (see Figure 10.3) is to order nothing 
this month. 

The discussion up to this point does not furnish a means for actually 
calculating Spo, since the function g(S) is defined in terms of the undeter- 
mined function f. We can, however, obtain Sp from the known function 
I as part of the following argument. Assuming that an optimizing 


a(S) 


So 3 


Figure 10.3. Concavity of variable term in equation (10.12). 


value So exists; then, for s < So, we have 


(8) = I(s) + 9(So) 


= I(s) +0 F ((So — 2) + g(So)] p(x) dz 


=I(s)+v} I(So — x) p(x) dx + v g(So) 
0 
Equating I(s) + (So) to the last line and transposing, we obtain 


(10.18) g(S») = —— ff 1(8 — z) p(2) de 
l—vdJo 


Hence, for s < So, 
0) = 10) + f "US ~ 2) pla) de 
Thus f(s) is the present worth of a series of monthly payments 
(1 — ») f(s) = (1 — ») I(s) +0 P"U(S0 — 2) p(z) dx 


[Recall the present worth of a payment X each month for an indefinite 
period is X/(1—»).] 
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In the limit as v tends to unity these payments become the long-term 
average cost, which is therefore 


10.14) f I(So — 2) p(z) de 


If we were to start at a point in time where s exceeded Sp, the same 
result would apply. For, we would order nothing until stocks fell to 
some point s’ below So, and repeat the analysis from that point. Since 
the result (10.14) is independent of the initial stock level s, the long- 
term average from that point on will coincide with (10.14). The costs 
incurred up to the point where the stock level falls below Sp make no 
contribution to the long-term average cost; and so (10.14) will apply 


from the present time on as well. 
If the policy were to order S;—s each month, where S; # So, then 
the long-term average monthly cost would be, after a calculation similar 


to the above, 


(10.15) f I(S, — 2) p(x) dx 
0 


Since So describes the optimal policy, (10.15) must exceed (10.14). The 
conclusion is that So is that value of S which minimizes the known 


function 


0 
(10.16) f 1(S — 2) pla) de 
0 
In this problem, the function J! (s), the expected cost for the present 
month starting with stock level s, is readily computed by the methods 


of Chapter Four. There are two different expressions for J(s), depend- 
ing on whether s > 0 ors <0 (back orders). 


(10.17) I(s) = f ‘a(s-3) plz) de 


+f [S49 owe s>0 


19) = Gof (Z-8) 9 ae #<0 


(10.18) 
= ox(= = ) where Z = average monthly demand 
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To obtain the value So that minimizes (10.16), we shall require J’(s). 
Differentiation of the above expressions for I(s) yields, after simplifi- 
cation, 


V's) = Ci + C2) f p(z) de 


J) 
+5(C1 + 02) f Ee aeeate cage 0 
2 2 
Vie ns 8<0 


(10.19) 
Differentiating (10.16) with respect to S and putting the result equal 


‘to zero gives 


1 
f 10-2) v@) ax =0 
0 
For use with (10.19), we write this as 


So 0 
f I'(So — x) p(x) da + if I'(So — x) p(x) dx = 0 
0 So 


and obtain 


f i [+e f ey ay 
pty) 


(Go = ac; $c.) | Pe c.| AGES 


Soa Y 


+f (-C2) n@) dx = 0 
8 
This may be rewritten as : 


C1 z Cs [re { 6) dy + (Sp — z) f "PO ha 


From this equation, the optimal value So may be obtained numerically. 

The reader can verify, using the methods of Chapter Four, that this 
equation for So is precisely the equation that So satisfies if we set out to 
minimize expected costs in the month after delivery of the order we are 
about to place. Hence in this situation the optimal short-term and 
long-term policies coincide. The reason for this is that the stock 
level immediately after a delivery is made depends only on the S 
chosen for that particular delivery and is independent of all previous S. 


(10.20) 


Example 4 

A plant manager has to schedule regular-time production on the 15th 
of each month for the coming month. To make his decision, he has at 
his disposal the following data: 

(i) the inventory level s on the Ist of the following month. 
(ii) the probability density p(x) for the monthly demand x. 

(iii) the cost of holding inventory, c per unit per month, 

(iv) the cost of regular-time production, a per unit. 

(v) the cost of overtime production, b(>a) per unit. 

By the time the first of the month comes, he knows exactly what the 
demand «x will be, and schedules just enough overtime, if necessary, 
so thait all orders will be filled. 

Both regular and overtime production as well as the demand can be 
considered as taking place uniformly during the month. How much 
regular production should be scheduled on the 15th, so as to minimize 
the long-term average monthly cost? 

SoLuTion 

Let q be the amount of production scheduled for regular time. If 
the demand z is less than s+q, inventory at time ¢ during the month 
(t ranging from 0 to 1) will be 


s+ qt — at 


If the demand - is greater than s+g, an amount x — (s-+-q) will be 
produced during overtime; and inventory at time ¢ will be 


stg —at+[x —(s+@t= sl — 


By integration with respect to t, we find that the average inventory 
levels during the month will be 


s+hq—2) if rSs+q 
= if c>s+q 
2 

If we write S = s + q, the average inventory is 
A(S+s—2) if «<8 


= if 2>8 
2 
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Hence, denoting the expected cost during the following month by 
I(s, S), we have 


8 0 
(10.21) I(s,S) = st (S+s— 2) p(x) da+ sf, 8 p(x) dx 


+oS— 9) +0f (@ = 8) p(x) dz 


The first two terms in (10.21) are inventory holding costs; the third 
term is the cost of regular production; and the last term is the cost of 
overtime production. 

If we were concerned only with costs during one month, we would 
choose S = S, where § minimizes J(s, S) in the range s <S < », To 
determine S, we find the absolute minimum of J (s, S) for the given s 
by calculus, and see whether the minimizing S exceeds s. If so, we 
would take the minimizing S as 8; otherwise, we would take § = s. By 
use of the formulas of Appendix II, we find that 


vn ef de BS ic di 
ag 7g, PO Uta fa 


which may be rewritten as 


or 76 & 
(10.22) ~~ (5+)f p@arta—s 
os 0 
The condition on S for I(s,.S) to be a minimum becomes 
8 b-a 
(10.23) iL 2) dx = 
R Pz) b+ o/2 


It may be verified that (10.23) yields a minimum for I (s, 8S). Thus 
our policy would be to produce S—s in regular time if S exceeds s, and 
nothing in regular time if § is less than s. We will now show that this 
policy does not minimize the long-term average monthly cost. 

Let fn(s) be the minimum achievable discounted cost (expected) for 
n months, when we start with an inventory s, and costs are discounted 
by a factor » (<1) each month. Then f,(s) must satisfy 


S 
(10.24) f4(s) = Min [t0, 8) +f fra — 2) pa) ae 
S2a 0 


gh f Sn—a(0) p(z) as} 
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The first term in braces is the expected cost for the first month; the 
second term is the portion of the discounted cost in the remaining (n—1) 
months ascribable to the possibility « < S. If 2 > S, then overtime 
will insure that the following month starts with zero inventory; hence 
the third term. 

If we assume (as can be proved) that the sequence of functions f,(s) 
tends to a limit function f(s) as n tends to infinity, then f(s) must 
satisfy the following functional equation: 


(10.25) ss 
BACs {10,8) +o i SS — 2) pz) dz + 0 : S(O) p(@) as} 


As in the previous example, we shall obtain the minimizing value So(s) 
by a somewhat devious argument. 
We first differentiate the expression in braces with respect to S 
and set the result equal to zero: 
al 
—-+v] f'(S— 2) p(z) dz =0 
a3 + y t¢ ) pi 
Since 91/88 is independent of s, as is the second term above, we deduce 
that, for sufficiently small s, the optimizing Sp is independent of s. 
Returning to (10.25), we have 
So va) 
(10.26) f(s) = I(s, So) +2} S(So = 2) p(x) dx +» / S(O) p(x) dx 
0 0 


for s < Spo. From (10.21), we find that I(s, S) may be broken down into 
the sum of [c/2 — als and a function of S. Specifically, for S = So, 


(10.27) 1(8, 80) = [s ” a] ers (G * ) [re de +a- o| 
= ; 7 1% p(x) dx + of's p(x) dx 


Symbolically, we write (10,26) as 


(10.28) f(s) = G = a) 8 + g(So) 


where g(So) is the sum of the last two terms on the right side of (10.26) 
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and the last three terms on the right side of (10.27). From (10.28), we 
have 


ft — 2) = (- a) (8 — 2) + o(60) 


(10.29) 
f(0) = (So) 


Substituting (10.27) and (10.29) in the right side of (10.26) gives 


SS) = G == a) + So [G + ) [re dx+a— | 


od : f te p(x) dx + of: p(x) dx 
+of [( zs a) (8) — 2) + a(S) | p(a) de 


+e , (So) p(a) dx 


This can be reduced to 
(10.30) : 
fis) = G-2)s+s[Sa+9 - a +0] f ‘P@) de 
0 


+ So(a — b) — [5 (l+v)- a| [- p(x) dx 


+f 2 p(0) de + o9(S) 
0 
Equating (10.28) and (10.30), we may solve specifically for g(So): 
(10.31) of) ee 
1-v 
where 


So 
(10.32) F(Sp) = So [5 (ea) = ote | f mayest Sa — 8) 
0 


= [sat — a0] fre ae of rca 
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Writing 1/(1 — v) as 1 ++? +---, we can write (10.28) as 


J(s) = (5-4)s+ Fis) + v F(So) + 0? F(So) ++** 


Thus the long-term average monthly cost, using an optimal policy So, 
is just F'(So). 

If we were to use any other fixed value S; as a policy, so that each 
month we scheduled regular production of S;—s, we would find in 
exactly the same way that the long-term average monthly cost would 
be F(S;). Since So is optimal, F(S) < 7'(S;); and we conclude that So 
minimizes F'(S). Differentiating (S) with respect to S, and setting the 
result equal to zero yields, as our condition for So, 


F ‘a 
(10.33) f 7 ota) dem - eri 
. rhs +) -—a+b 


Equation (10.33) should be compared with equation (10.23) for 8. 
The equations coincide for » = 0; this is to be expected, because, when 
v = 0, we take no account of costs incurred after the first month. As 
» approaches 1, the policy approaches the solution So of the equation 


So 4 b-a 
f pe) dr = 


If s exceeds the solution Sp of (10.33), the optimal policy is to schedule 
no regular production this month, Eventually, following the optimal 
policy, the stock will fall below So, and remain below. The long-term 
average monthly cost, as in the previous example, is not affected by 
whether ¢ is initially greater than or less than Spo. 


‘As we have noted in both this example and the last one, it is a fairly 
simple matter to deduce the average monthly cost for any policy in- 
volving a fixed S, say Si, independent of s. Having done so, we can 
easily choose S; So as to minimize the average monthly cost. However, 
it is not obvious that the optimal policy has this form unless we estab- 
lish this fact, as we have done in Examples 3 and 4. Unless we can 
prove the opposite for a particular situation, we have to regard the 


optimal S as an undetermined function of s, rather than an undeter- 


mined constant. 
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PROBLEMS FOR SOLUTION 
Set XV 


1. In the situation of Example 4, show that, with a policy of having a con- 
stant value of S, say S’, each month, the average inventory at the end of each 
month will be 


[8-2 pla) ae 


Hence show that the average monthly cost will be 
st , 

So+e-af p(x) dx — S'(b — a) + (a— of zp(2) de + bf 2 ple) de 
0 0 Ss! 


Deduce that the policy of minimizing costs in the month following the de- 
cision about S leads to an average cost per month of 


= a)(e/2 — a) ; 
OTe + eof near + Of 2 nde 


where Sp is the value of S which satisfies equation 10.23. 


2. In an inventory situation, orders are placed monthly, and the costs 
during the month after each order are 


1(s,8) = Li(s) + 1:(S) 
where s is the inventory immediately before the order is placed and S—s is 
by pontiy ordered. [J,(s) is independent of S, and Jo(S) is independent 
of 8, 
If f(s) is the minimum expected discounted total of all future costs, and 
p(z) is the probability density function for monthly demand, show that the 
optimal value § of S satisfies 


So 


I'(8) +0 |, lS — 2) p(z) dx = 0 
and that the minimum expected cost is given by 
10) = 110) + = U8) + of 148— 2) ple) del 


provided s < 8. 

Deduce the equation satisfied by the S that minimizes the average monthly 
pp obtain an expression for this cost. [Hint: Reason as in Examples 
3 and 4, 


3. A purchasing agent makes decisions about orders every month. If he 
decides to place an order, there are ordering costs of amount C. The cost of 
holding inventory is C; per unit per month, and shortages cost Cz per unit 
per month. The probability density function for monthly demand z is p(z)- 
Demand can be assumed to take place continuously during the month, and 
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the backlog from one month is filled as soon as next month’s shipments are 
received. If the agent places an order it is filled immediately. 
The following notation will be used: 


3 = inventory level immediately before a decision on ordering is made 
S = inventory level immediately after a shipment is received 
fa(s) = minimum total expected cost during a period of n months, which 
starts with an inventory s 
1(S) = expected cost of inventory and shortage during a month which starts 
with an inventory S 


Show that 

(a) fi(s) = min {Min {1(S) + Cl; 1(s)} 

(b) If costs are to be minimized over one month, an order will be placed 
for an amount Si—s, provided s < s:, where 


(1) Si satisfies 01/AS = 0. 2 
(2) I(S:) + C = 1(s1); 81 <1. If s > &1, no order will be placed. 


(c) Forn > 1, 
fu(s) = min {Min (1) + C+ Ae fa-(S — 2) plz) dz}; 
1) +f fu-s(¢ — 2) (2) a) 


(d) If costs are to be minimized over n months, an order will be placed for 
an amount S, — 8, provided s < 8», where 


(1) Sp satisfies 
al Ofn-1 ig de =0 
T+ Bg S- 2A) 
and 
(2) 8, satisfies 


KS.) + 0+ de Fn-x(Sn — 2) pla) de = In) + i fa-s(tn — 2) pa) dr 


(e) Determine the form of 1(S) in terms of C1, C2, and p(x). 
(f) Sketch the graph of fils). 
ions i i ively by mechanical 
Note: The equations in (d) would have to be solved iterative! 1 a 
Ritter: Such iterative numerical solutions are usually needed in gran 
programming; only in exceptional situations can an explicit solution be 6 


tained in terms of commonly tabulated functions. 
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APPENDIX 


Finite 
differences 


The use of the calculus of finite differences can lead to a simplification 
of notation in many numerical processes. Many of the results are 
directly analogous to those of the infinitesimal calculus; and, in fact, 
many formulas of the latter can be derived in a simple manner from the 
corresponding formulas of the finite-difference calculus. 


DEFINITION OF THE DIFFERENCE 
OPERATOR A 


We shall be concerned with functions f(n) defined only for integral 
values of the argument n. 

The “first difference” of f(n) is denoted by Af(n) and is defined by 
the formula 


(I.1) Af(n) = f(n + 1) — fn) 
Second and subsequent differences of f(n) are defined by 
(1.2) Atf(n) = A{AT(n)} 


(The expression Af(n) is read as “delta f(n)” and A’f(n) is read as 
“delta to the r of f(n).”) 
294 
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It is readily verified that 
(1.3) artty(n) = Ar{Af(n)} = Asn + 1) — AF) 


Equation (1.3) becomes true for all integral r > 0 if we make the con- 
ventions A° = identity operator, At = A. 


CONDITIONS FOR A MINIMUM OF f(n) 


The function f(n) will have a “local minimum” at no provided both 
the following conditions are satisfied: 


Sino + 1) > f(mo); i.e. Af(ro) > 0. 

fino — 1) > f(r); i.e. Af(ro — 1) < 0. 
Thus f(n) will have a local minimum at 79 if 
(1.4) Af(r — 1) <0 < Af(ro) 


The function f(n) is said to have an absolute minimum at no if 
f(no) <f(n) for all n. Sufficient conditions for f(n) to have an absolute 
minimum at no are that (I.4) be satisfied, and also that 
(1.5) Asin) >0 forall n 
The conditions for f(n) to assume a maximum at no are analogous. 
(These conditions are sufficient for a minimum; however, they are not 
necessary. A precise statement of necessary conditions is more in- 
volved, and will not be given here.) 


FIRST DIFFERENCES OF COMBINATIONS 
OF FUNCTIONS 


If f(m) and g(n) are any two functions of n, then it is readily verified 
6) Afs(n) + g(n)} = 46) + Ag(n) 
(1.7) Afs(n) g@)} =f@ + Y Ag(n) + g(n) afm) 
gn) gin) Afr) — J) Ag(n) 
ie tra} = gin + 1 9) 
Also, for any function f(n), 
(1.9) A{kf(n)} =k Af(n) for ka constant 


1 2) 
aa aly =~ fn DIM) 
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A formula analogous to the formula in calculus for differentiating 
2’ results if we define 


My = n(n —1)-+-(n—r+1) _ r factors 
Then 


(1.11) Any = rN for fixed r 


Other formulas that are helpful in differencing particular functions 
are 


(1.12) A{n} =1 
(1.18) Al{Car} = Cru fixed 
(1.14) Af{a"} = a"(a — 1) a fixed 


SUMMATION OF SERIES 


We are interested in obtaining a closed form for an expression of the 
type 


b 
Ls) = fa) + fa + 1) +--+ $0) 


The formula obtained will be analogous to the fundamental theorem 
of the integral calculus, which evaluates 


if ‘le) de 


as F(b) — F(a), where F(a) is an antiderivative of f(x). 
Let us call F() an “antidifference’”’ of S(n) provided 


(1.15) AF(n) = f(n) 
b b 
Then Xfm = YL ar@) 


=F +1) — F0) 
+FO) = —Fe-1) 


+ Fa +2) — Fa +1) 
+ F(a+ 1) — F(a) 
= Fb +1) - Fa) 
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Thus, ‘if F(n) is an antidifference of f (n), 
b 
(1.16) Lm) = [FO = Fo +1) — F@) 
Note that the upper limit, unlike in integration, is b+1 and not b. 


SUMMATION BY PARTS 
In calculus, a definite integral of the form 


b 
ff $0) aac 
is often more easily evaluated by integration by parts, viz.: 
b b 
f 1) date) = 10) 90) = (0) (0) - ff 9 ane) 


In finite differences, we have an analogous formula which follows 
readily from (1.7): 


6 
(17) De s(n) Ag(n) 
n=a : 
=f +1) 9 +1) —f@ 9@) — Dan +1) afm) 
As an example, suppose we wish to compute 


k 
Lone" 
n=l 


We cannot apply (I.16) directly, since we see no easy way to anti- 
difference the function nz”. But we can antidifference 2”, since 


ier 


Applying (1.17) with f(n) =n, g(n) = x"/(x — 1), we have 


k (k ES 1)xk* x k ght 
Li! =a pee 1 
at 2-1 z n=l 
(k+ Dat z ght? rad 
aed A rl TS. G1)" 


where the last step follows from (1.16). 
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DIFFERENCING UNDER A SUMMATION 
SIGN 


In inventory applications where the stock level z is regarded as a 
discrete variable, we often run into the necessity for computing the 
first difference of some function C(z) of the form 

C@) Lad oom Je Y's U; 2) 
If f(x, y, «++, u; 2) has the same functional form throughout the region 
of summation, and if the boundary of this region does not depend on z, 
then we can apply (I.6) to each term separately, and obtain 
AC(2) = > Muy 1s m2) 

where the differences on the pie are with respect to z. However, if 
Jf has different functional forms in different sectors of the region of 
summation, or if the boundary of the region depends on z, the compu- 
tation of AC() i is usually more complicated. 

Let us first consider the one-dimensional case where 

d(2) 
C@= L f@,a 
o=ma(z) 
Then 
b(2+1) 
Ce+I= DY f(e,z+1) 

w=a(z+1) 

b(2) b(z+1) a(z+1)—1 


= 2 feet) + 2X feet — Le feet 


w=a(z) @=b(z)+1 a(z) 


b(2) b(z+1) a(z+1)—1 
(1.18) AC() = 2 Ale 2) +, Jerry =. Df, 2+1) 


a(z) 
[We are assuming that a(z), a are increasing functions of z; if this is 
not so some modification is required]. 
Now suppose that f(z, z) is defined by 
fina a 2) for z in the interval 0 < x < b(z) 
’ fo(z,2) for x > b(z) 
and that we wish to difference 


and 


C® = LI@,2) 
z=0 
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We have pet 
C@) = ZAt, 2+ = So(z, 2) 


z=b(2)+1 


Applying (1.18) to both sums, we obtain 


AC(z) 
b(2) b(2+1) 
= X Ahla, z)+ _E, shte 2) us = ne, z+1) — fo(x, 2+1)] 


Thus we have our final result for single sums: 
w 
AC() = X Af(z, 2) 
=O 


provided that fi(x, 2+1) equals fo(x, z+1) for all z in the interval 
bz) -1<a2<be+)). 
We now wish to extend these results to double sums. Suppose C(z) 


is defined by cle 
z * 


c= LX sen 


yena(2) Zeeu(yss) 


and that we wish to find AC(z). 
Let us define g(y, 2) by 


v(y,2) 
gy2)= L feu) 
amu (y®) 
Then 
b(2) 
C@) = » au 2) 
yaa. *) 
and, applying (1.18), we obtain 
a(z+1)—1 
AC@ = > Ag(ys2) + * ‘ae +1) — » gy, 2+1) 
yma 2) als 
Then applying (1.18) to gly, 2) yields 
v(y.2+1) 
Ag(y, 2) = oe Aite nat my be y, 2+1) 


u(y.et1)—1 
- LY f@,v,2+1) 


u(yra) 
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where all differences are with respect to z. Hence 


b(2) v(u,2) 


(1.19) 2c@= > D Aey2 


y=a(z) z—u(yr2) 


b(z) v(y,2+1) 
xX f@,y,2+1) 
a(2) v(y,z)+1 
b(z) u(y,z-+1)—1 
Do Ly fl%¥s2+1) 
a(z) — u(ysz) 
b(z+1) v(y.2+1) 
LX fe, y,2+1) 
b(2)4+1 u(y.s-+1) 
a(z+1)—1 v(y,z+1) ! 


- 2 2 feyet+h 4 


a(z) u(yz+1) 


(Again we are assuming that the boundary functions a, b, u, v are in- 
creasing functions of z.) 
Now let f(x, y, 2) be defined (see Figure I.1) as 


a 


Si(@, y, 2) in the region [0 < x < uly, 2);0 <y <b) 
Fo(x, y, 2) in the region [u(y, z) < 230 <y < b(2)] 
Ja(2, y, 2) in the region [0 < x < u(y, z); b(z) < y] 
Sa(@, y, 2) in the region [v(y, z) < 2; b(z) < y] 


Figure I.1. Subregions of region of summation in two-dimensional case. 
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Then 
C® = LU VIieye 
0 0 
b ou bw o »v o @ 
=LYA+tLUAt+ UCUK+ 2D Uh 
0 0 O upd b+1 0 b+ wl 
Applying (1.19) to C(z) yields 
bu b @ o o @ 
ace) = DLaht+ DU skt+ OL s+ LU 4k 
o 0 0 u+l b+1 0 bl wl 
b u(y.e+)) 
+ x » (hie, ys 2+1) — falz, v, 2+) 
© v(y.z+1) 
+ ZX Vale, y, 2+) — fay, 2+] 
bl whl 
b(e+1) puy.etl) © 
+ [ LD A@wetD+ LD fale,ye+1) 
b+1 0 u(yath) +1 
v(y,2+1) 


J 
eye) - E hewetd] 
0 O(yratl) +h 
It is possible for fi, fo, fa, fa to be such that all terms except the first 
four cancel out. However, we cannot state sufficient conditions for 
this to happen in as simple a manner as we did for single summations, 
In practical cases, where we have to obtain differences in order to 
minimize some expected value, and where in the continuous analog the 
function being averaged is itself continuous, we may suspect that the 
first difference of the expected value in the discrete case will contain 
only four terms. It is usually worth checking such problems to see if 
such simplification is possible. 
Example 
Obtain AC(z), where 
a ay 2 
o@) = 5 Ea(e-v- Jeo 
y=0 t=0 
te E Liew + e+ y- aoe 
y= rea—ytl 2x 
oe o ‘an 
sees o(e+y - «) (2) h(y) 

yor+l =O 

(This is the cost function of Example 7, Chapter 4.) 
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SoLuTIoN 
The summation is over the first quadrant of the xy plane, divided 


into sectors as shown in Figure I.2. 


Figure 1.2. Subregions in the example. 


In the notation of this section, we have 


f= 0,(2-y-5)o@ ray 
1 
fy = Rie —yP + Cale + ~ 97100) Ma) 


fa = 0n(E + y —2) 00) hw) 


b(z) =z 
uly, 2) =2-y 
vy, 2) = +0 


sC@ = DY DV Cig(e) h) 
y=0 z=0 


z © 


“yes - [‘c, + CG -y+ ;) ~ c,| g(z) hy) 


y=0 t=z~y+1 


o i} 


- LY YX Crg(e) hy) 


u=2+1 750 


z 2H 


+> 2D fo. (e+1-y-2)~ Stee +1 ~ 9 


QO z-y+1 
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+ Cx@+y-—2- i} g(z) hy) 


z+] ¢2z—ytl z 
+z | x o,(:41-v-9)o@ 
2. I 
+ 2 PCC rie A a 
z—y+2 40 


-La(Ety-=- 1) o@)| me 


except the first three terms reduce to single 


Tt will be seen that all 
all except the first three terms vanish. 


summations, and in fact, 
Thus 


AC) = x x C1 g(a) hy) 


+5 5 -[G+ Cr ;)- c2] 0 


0 z—y+l 


= YY Cr9() hw) 


z+1 0 
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APPENDIX 


Differentiation 
of integrals 


Let 
b(2) 
F=f fe,2) ae 
a(z) 
It is not difficult to verify from first principles that, if f(x, z) possesses 
a continuous derivative with respect to z throughout the region 
ex<z<d 
a(z) <x < bz) 
and if the derivatives da/dz and db/dz exisv, then 


dF 2) af db da 
IL1 —= —dx+ ee oo 
(IL1) i [ies S(b, z) Z S(a, 2) j 


for all values of z in the interval c Sz<d. 
It is obvious that, if a and b are constants 


dz a OZ 


and it can be shown that, provided the integral on the right converges, 
the result is still true when b = © or a = x, 
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In operations research, particularly in inventory problems, we often 
have occasion to differentiate functions F(z) defined by 


b(z) c) 
a2) Pe =f hleadet+f fale, ade 
0 b(2) 
If fi(z, 2) possesses a continuous derivative with respect to z in the 
region 
e<z<d 
0<27< be) 
and if f(x, 2) has a continuous derivative with respect to z in the region 
ex<z<d 
xz > b(z) 


then, using (II.1), we see that 
ar af; ° Af db 
3) — eee 7 &9 0 + bald) — fold, 201 
(II.3) o f a le rs a lit ) — fal 
If the expression in brackets in (II.8) is identically zero, then clearly 
dF'/dz may be obtained simply by differentiating the expression (II.2) 
for F(z) under the integral signs. That is, 


ONE) nie) 
(114) =-f hac + [Bae 
provided 
(II.5) Silb(@), 2] = fold), 21 
A suggestive way of interpreting (II.4) and (11.5) is to rewrite F(z) as 
(11.6) F@) = f f(a, 2) dz 
where 


flc,2) =file,2) for 0<7< be) 
ll fz, 2) =folz,2) for be) << 


Then dF/dz is given by (II.4) provided the function S@,2) is a con- 
tinuous function of 2, which possesses a continuous derivative with 
respect to 2 except possibly at points on the curve z = b(z). bn 

The result (II.3) can be extended to double integrals by applying it 
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twice, as follows. Let 


b(z) prysz) 
(18) o@=f  f  fewnaedy 
a(z) “u(yz) 
Sufficient conditions for the repeated application of (II.3) are that the 
derivatives df/dz, du/dz, and dv/dz exist and be continuous with respect 
to z throughout the region of integration. Then, by writing 


Fy, 2) =f fey2) de 


we have F 
G@ = J Fy, z) dy 
Then s 
dG oF db da 
mee Il sangeet ad 
Fateh Mies y + — FO, 2) em *) 


The result, which is analogous to (II.1), is 
dG > Per an 
toy f ib Nie si 2 
(11.9) Price eee dy + } age) de 


boy db v(b,2) 
= ih oft, ys2) de + 2 f pov aa 


da v(a,z) 
== @, a, 2) dx 
Gs dude J(@, a, 2) 

We shall now state without proof the double-integral analog of (I1.4) 
and (11.5). Let a region A of the xy plane be divided into disjoint sub- 
regions A;, where A; = A; and suppose that for each subregion A; 
there corresponds a function Si(x, y, 2). Suppose F(z) is defined by 


(II.10) FF = ‘ 
O=Ef freuen day 
Then we may calculate dF/dz by 
dF Of; 
IL.11 — — 
¢ ) dz x hi f dz #2 wy 


provided the following two conditions hold: 


(a) The boundary of the entire region A does not depend on z. 
(b) If A; and A j are adjacent subregions, then /f,(z, y,z) equals 
Ji(@, y, 2) at all points (x, y) along the common boundary of A; and Aj 


APPENDIX 


Row 
operations 


Each step in the application of the simplex method (see Chapter 
Eight) was found to involve the solution of a system of m equations in 
n(>m) unknowns, This was done efficiently, in the simplex tableau, by 
certain row operations performed on the coefficient matrix. This ap- 
pendix gives the details of this method for solving systems of linear 
equations. 

First, let us consider a situation where there are as many inde- 
pendent equations as there are unknowns. In such a situation, there 
will be a unique numerical solution for the system. Consider the 


system rating eS 
=a, + %2 + 2g = 11 
32g — 13 = 1 


A common method of solving such systems is the method of elimina- 
tion. If the second equation is replaced by the sum of the first and 
second equations, the resulting system is 

ay +X = 3 
Qa + xz = 14 


329 — z= 1 
307 


308 -...___ Operations Research: Methods and Problems 


This system will have the same solution as the original system. Con- 
tinuing, we replace the second equation by the sum of the second 
equation and twice the third, obtaining 


Ty + 22 = 3 
82g = 16 
32 —%= 1 


Dividing the second equation by 8, we get 


t+ 22 =3 
Xs =2 
322 — zg = 1 


Continuing in this manner, we finally obtain the equivalent system 
ry =1 
2 =2 
t%=5 


which is also an explicit solution of the original system. 

Tt can be seen that all the operations on equations in the elimination 
method can be considered as operations on rows of the f ollowing matrix, 
which represents the original system: 


DHA reg 
5 Ipeat | e a b' 
03 -1! 1 
For, if we add the first row to the second, we get 
i att 0| 3 
02 2/14 
03 -1!/ 1 
Add twice the third row to the second: 
Tm Ixe¢ 40.b83 
08 Of 16 


0) ..3, Fea | 1 
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Divide the second row by 8: 


Subtract the second row from the first: 
1 0 0} 1 
Dit 0|2 
0 3 -1{1 
Subtract three times the second row from the third: 
10 0 1 
Ouly O 2 
0,0; —1 1-5 
Multiply the third row by —1: 
1. 10 OF 
01 0)2 
001/15 


Re-interpret the matrix as a set of equations: 
a =1 
XQ =2 
ag =5 
which is the desired solution. 
The essential feature in solution by matrices is that by a succession 
of row operations of either of the following types: 
(a) multiplying or dividing all elements of a row by some constant, 
(b) replacing a row by the sum of that row and a multiple of some 
other row, 
we eventually can reach a matrix with ones and zeros in appropriate 
positions, from which a solution to the original system may be read off. 
Now consider a situation where there are more unknowns than 
equations (as is the case at each iterative step of the simplex method). 


If there are m independent equations in n unknowns, and if n > m, we 
know from algebra that there is no unique numerical solution, but that 
any designated set of m of the variables can usually be expressed in 
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terms of the remaining n — m variables. The method of elimination, or 
its matrix analog, may be used exactly as in the case where m = n. The 
following example will illustrate the method. 


Example 
Given the: following system of three equations in five unknowns: 
+ 2to+ m4 = 7 
%o+ t+ t+ ts = 24 
ca a’ 323 +225= 8 


Solve for 21, x3, and zs in terms of the remaining variables. 


SoLuTion 
The matrix representing the original system is: 
Ue 1) Troy i7 
01 1 1 1/24 
10 -3 0:21/:8 


Subtract row 1 from row 3: 


0 1 1 1 1] 24 
0 -2 -3 -1 2) 1 
Subtract (4) (third row) from second row: 


Oo 
wom 8 x 


Add (2) (second row) to third row: 


1 


0 2 
0% 


i~) 


0 
3 
0 


eB i a 
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Multiply second row by 2: 
L 2 Ord aOn herd 
04142 0;% 
OF 0% Male 


Re-interpreting the last matrix as a set of equations, we have 

a+ 2ag+ tm =7 

42+ 23 + $x = 

dro t+ Banat % = 43. 

which yields as a solution to the problem as stated 

m= 7 —2%,- % 

%3= Ar ad $22 a $s 

ts = 48 — $22 — B4 


Index 


Algebraic identities, sum of first r 
natural numbers, 26 
sum of squares of first r natural num- 
bers, 26 
sum to infinity of ny", 26 
Allocation, applications of linear pro- 
gramming, 236-246 
assignment problem, 184, 185-194 
linear programming (simplex), 183-- 
184, 220-249 
references, 249 
transportation problem, 184, 194- 
220 
Arrivals (in queueing), 
126-127 
mean arrival rate, 127 
Artificial variable (in simplex method), 
234-236 
Assignment problem, problems for solu- 
tion, 193-194 
solution procedure, 185-192 
specialized to traveling salesman 
problem, 265 
statement, 185 


distribution, 


Beta distribution, 32 
Bidding, 172-175 
Bias (in sampling), 42-43 


Binomial distribution, derivation, 14- 
15 
form, 23 
mean and variance, 23 
sampling from, 45-46 


Cell evaluations (in transportation prob- 
lem), 203-207 
Competitive bidding, 172-175 
Competitive games, approximate solu- 
tion, 170-171 
definition, 156-157 
graphical solution, 180-181 
problems for solution, 180-181 
references, 182 
saddle-point games, 158-159 
solution defined, 157 
solution procedure, 157-170 
value, 157 
Competitive situations other than 
games, bidding, 172-175 
marketing, 176-180 
Composite events, 8-12 
Conditional probability, conditional 
density function, 132 
conditional mean, 132 
definition, 11-12 
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Continuous probability distributions, 
beta, 32 
curnulative, 29-30 
definition, 29-30 
exponential, 31, 127-128, 144 
gamma, 32, 56-57 
mean, 30 
normal, 31, 55-56 
rectangular, 31 
variance, 30 
Cumulative distribution function, as 
basis for Monte Carlo procedure, 
59-60 
continuous, 29-30 
discrete, 24 
Curve fitting, 53-58 


Density function, 30, 37 
Departing variable (in simplex method), 
227, 229-230 
Differentiation of integrals, formulas for, 
304-306 
used, 33, 35, 98, 120, 173, 288 
Dimensional checks, 79 
Discounted costs, 105-107, 281-282 
Discount rate, 106 
Discrete probability distributions, bi- 
nomial, 14-15, 23, 45-46 
cumulative, 24 
definition, 20-21 
geometric, 23-24, 27 
mean, 21-22 
Poisson, 23, 58, 87, 127-128 
uniform, 23 
variance, 21-22 
Dominant strategy, 167 
Dynamic programming, finite number of 
decisions, 271-281 
infinite number of decisions, 281-291 
introductory, 270 
“principle of optimality,” 272 
problems for solution, 279-281, 292- 
293 
references, 293 


Economic lot size formula, 72 
Effectiveness, definition, 1-2 
measure of, 2, 126, 281-282 
Elapsed time (in sequencing), 251 
Elimination, 227, 307-311 


Entering variable (in simplex method), 
227, 229 
Enumeration, 183, 250, 265, 276 
Erlang distribution of service time. 143- 
145, 
Evaluations, in simplex method, 228, 
236 
in transportation method, 203-207 
Events, composite, 8-12 
independent, 12 
intersection of, 9 
mutually exclusive, 10 
simple, 5-8 
union of, 9 
Expected costs, 81 
Exponential distribution, as special case 
of Erlang, 144 
exponential service times, 127-128 
form, 31 
mean and variance, 31 


Feasible solution, sequencing problem, 
250-253, 260 
simplex problem, 226 
transportation problem, 195-196 
Finite differences, elements of, 294-303 
used, 84, 88, 95, 107, 112 
Finite multiplier (in sampling), 43 
Frequency function, 21, 28 
Functional equation, 283 


Game theory, see Competitive games 
Gamma distribution, 32, 56-57 
Gantt chart, 254, 262 
Geometric distribution, 23-24, 27 
Graphical solutions, in game theory 
180-181 

in linear programming, 247 

in sequencing, 262-263 
Group replacement, 108 


Holding cost of inventory, 70-71 


Independence of cells in a matrix, 202 
Infinitesimal allocation, 213 
Intersection of events, 9 
Inventory, costs, 70-71, 75 
dynamic programming approach, 
282-291 
economic lot size formula, 72 


Inventory, introductory, 70-71 

problems for solution, 80-81, 99-101 

references, 101 

with known demand, 71-80 

with probabilistic demand, 81-99 

with production lead time, 93-99, 
282-291 


Joint density function, 37 
Joint frequency function, 27-28 
Joint probability distributions, con- 
tinuous, 36-38 
discrete, 27-29 


Lagrangian multipliers used, 48, 179, 
273 


Least squares, 54-55 

Life table, 113-114 

Linear programming, applications, 236— 

246 

geometric solutions, 221-225, 247 
introductory, 183-185, 220-291 
problems for solution, 246-249 
references, 249 
simplex method, 225-236 

Loop joining cells in a matrix, 202, 207 


Maximal assignment, 187-188 
Maximin (or minimax) criterion, 157, 
172 
Mean, of a continuous random variable, 
30 
of a discrete random variable, 21— 
22 


Mean arrival rate, 127 
Mean service tate, 128 
Mean service time, 128 
Measure of effectiveness, definition, 2 
in dynamic Programming, 281-282 
in queueing, 126 
Minimax (or maximin) criterion, 157, 
172 
Mixed strategy, 156 
Moments, method of, 55 
Monte Carlo sampling, application to 
waiting-line problems, 146-151 
definition, 58-59 
problems for solution, 67-69 
references, 69 
routine for, 59-60 


Multichannel queues, 128-131, 137-143 
Multistage queues, 149-151 
Mutually exclusive events, 10 


Normal distribution, 31, 55-56 


Optimality, nonoptimality test in se- 
quencing, 261 
principle of, 272 
test, in simplex method, 228 
in transportation method, 203-204 
Outcome of a play of a game, 156 


Physical sampling, 41-52 
Play of a game, 156 
Poisson arrivals, 126-127 
Poisson distribution, 23, 58, 87, 127-128 
Present value (worth), 105 
Principle of optimality, 272 
Probability, conditional, 11-12 
continuous, 29-38 
discrete, 20-29 
joint, 27-29, 36-38 
meaning, 5 
of composite and related events, 8-12 
of simple events, 5-8 
problems for solution, 19-20, 38-39 
references, 39 
Probability density, 30, 37 
Problems for solution, assignment 
method, 193-194 
dynamic programming, 279-281, 292- 
293 


games and bidding, 180-181 
inventory, known demand, 80-81 
Probabilistic demand, 99-101 
linear programming (simplex), 246- 
249 
Monte Carlo sampling, 67-69 
physical sampling, 52-54 
probability distributions, 38-39 
probability of events, 19-20 
replacement, 122-124 
sequencing, 267-269 
transportation method, 218-220 
waiting lines, 152-154 
Production scheduling, to meet a sales 
forecast, 244-246 
ith constant demand, 71-80 
Probabilistic demand, 81-99 


y 


| 


Production scheduling, with production 
lead time, 93-99, 282-291 

Program (in sequencing), 259 

Proportional sampling, 42, 47-48 

Pure strategy, 156 


Queueing, see Waiting lines 


Random numbers, normal, 60 
plain, 41, 60 
reference to tables, 69 
Random variable, continuous, 29-30 
discrete, 20-21 
mean, 21-22, 30 
variance, 21-22, 30 
Range of a sample, 60, 67 
Recessive strategy, 167 
Rectangular distribution, 31 
Recursion, 271 
Replacement, introductory, 102 
items that deteriorate, 102-108 
items that fail, 108-112 
problems for solution, 122-124 
references, 124 
staffing, 112-117 


Saddle-point games, 158-159 

Sample space, 6-12, 20-22 

Sampling, introductory, 40-41 
Monte Carlo, 58-67, 146-151 
precision of estimates, 42-43, 47-49 
problems for solution, 52-54, 67-69 
proportional, 42, 47-48 
references, 69 
simple random, 41-47 
stratified, 42, 47-52 
unbiased, 42-43 

Sequencing, graphical solutions, 262- 
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introductory, 250-251 
n jobs, three machines, 256-258 
two machines, 254-256 

problems for solution, 267-269 

references, 269 

two jobs, n machines, 258-263 
Service times, constant, 145-147 

Erlang, 143-145 

exponential, 127-128 

mean service rate, 128 

mean service time, 128 
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Set up cost, 71 
Shortage cost of inventory, 75 
Simple events, 5-8 
Simple random sampling, definition, 
41 
estimate of population mean, 43 
routine, 41-42 
variance of estimate, 43-44 
Simplex method, applications, 236-246 
geometric analog, 221-225 
optimality test, 228 
problems for solution, 246-249 
references, 249 
solution procedure, 227-236 
used in dynamic programming, 276 
Simulated sampling, see Monte Carlo 
sampling 
Slack variable (in simplex method), 223, 
225-226 
Solution of a game, 157 
Staffing situations, 112-117 
Standard deviation, 39, 44 
Strategy, dominant or recessive, 167 
mixed, 156 
pure, 156 
Stratified random sampling, definition, 
42 
estimate of population mean, 47 
optimal allocation to strata, 48-49 
variance of estimate, 47 


Technological feasibility, of a sequence, 
250, 260 
Transportation problem, optimality 
test, 203-204 
problems for solution, 218-220 
solution procedure, 196-218 
statement, 194-195 
Traveling salesman problem, 264-267 
Trial, 6-8, 29 


Unbiased sampling procedure, 42-43 
Uniform distribution, 23 

Union of events, 9 

Units, checks on, 79 


Value of a game, 157 
Variance, of a continuous random vari- 
able, 30 
of a discrete random variable, 21-22 
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Waiting lines, constant service times, Waiting lines, multistage, 149-151 
144-146 problems for solution, 152-154 
Erlang service times, 143-146 references, 154 
exponential service times, 126-143 Warehouse problem, 274-279 
introductory, 125-126 
Monte Carlo solutions, 146-151 
multichannel, 128-131, 137-143 Zero-sum game defined, 156 
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